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Preface 

Basic Statistics has been written to serve as the text in the Intermediate course 
in Statistics for the students of Part-I, The book has been written according to the 
new syllabus approved by the National Bureau of Curriculum and Text Books, 
Ministry of Education, Govt, of Pakistan, Islamabad. Thus the book would be equally 
useful for students of all the Education Boards in Pakistan, It has been written 
especially for Intermediate students but the students of B,Com; M.A, Economics, 
M.Sc. Psychology, M.Sc. Geography, Business Administration or any other area of 
social sciences can also benefit from this book. For beginners interested In learning 
the basics of the subject, this book should indeed prove to be a valuable gift. We 
would not like to make tall claims at this stage. We would rather leave it to the 
subject teachers and the students to evaluate the book and form their own 
independent opinions. We, on our part, are glad to say that we have fulfilled a long 
outstanding demand of our ex-students for such an undertaking. 

The book is the product of combined efforts of the youthful energies and long 
teaching experience of B.Sc, B.S and M.Sc, classes of one author and long teaching 
experience of the other, Both the authors are proud of their sustained and successful 
efforts in bringing out a book dedicated primarily to the problems faced by the 
subject teachers as well as the students. Some typical features of this book are:- 

(i) We have stressed on the solved examples so that the students may easily 

learn the subject by studying the examples. 

(ii) Theory has been written in simplified language. It would seem as if the 
teachers were speaking to the students and the simple situations of 
practical life were being explained. 

(ili) Diagrams have been used for effective illustration. 

(iv) Exercises contain some questions which are different from the age-old 
pattern of questions. New questions have been added to invoke fresh 
thinking. Students have been encouraged to generate their own exercises 
for better understanding and assimilation of the subject, 

(v) Short Definitions, Multiple Choice Questions and Short Questions are also 
included which are according to new syllabus and which will be helpful for 
students to have a command on the subject c 

Lot of pains have been taken in proof reading : 7 do not claim that the book 
is free of typing/printing errors. There may be errors the part of the authors, We 
apologize for any inconvenience caused on this account. We shall be grateful if any 
such errors or omissions are brought to our notice. For the improvement of the book, 
we would welcome any suggestions or criticism particularly from our friends in the 
teaching community. b 

We are thankful to our relatives, friends and colleagues who encouraged us for 
writing this book. We have special words of thanks and appreciation for Muhammad 
Khurshid Khan, the computer Operator who worked with us with devotion, 

Majeed Book Depot, Urdu Bazar Lahore, our publishers, have the credit of 
bringing out the book in time despite acute limitations. We owe extra special thanks 
for them. 


September 1, 2012 Muhammad Saleem Akhtar 
Ghulam Hussain Kiani 
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Chapter 


INTRODUCTION 


1.1. HISTORY 

Statistics is a very old word. It is from the English language. Nothing is known 
about the exact origin of the word. It is written in the books on statistics that this 
word perhaps been derived from some other language. For example there is a word 
Status in the Latin and in the Italian there is a word statista. Another word is 
statistik in the German language, All these words mean "the political state". 
Shakespeare used a word statist in his drama Hamlet (1602). In the ancient times 
the statistics was used by the rulers. The application of statistics was very limited. 
It was nothing but the record of armed forces, labour force, agricultural land, area 
and population etc, The records of human beings were very important to the kings. 
There is a lot of gap between the old statistics and the modern statistics. But the old 
statistics also makes a part of the present statistics, The English writers have used 
this word in eighteenth century in their works, W, Hooper used the word statistics in 
1770 in his translation of Elements of Universal Erudition, a famous old book 
written by Baron B.F. Biefeld. In this book one Chapter is on statistics, which has 
been defined as "the science that teaches us what is political arrangement of all the 
modern states of the known world". E.A.W. Zimmermann has defined the word 
statistics in 1787 as "that branch of political knowledge which has for its object the 
actual and relative power of the several modern states, the power arising from their 
natural advantages". Statistics has developed gradually during the last few 
centuries. Most of the work in statistics has been done in the end of the nineteenth 
and the start of the twentieth century, In 1875, a German physicist F.R. Helmert 
(1834 — 1917) developed a method called chi-square. This method brought a big 
revolution in statistics. 

In 1909, Karl Pearson ( 1857 — 1936 ) worked on the chi-square and suggested 
a very useful test called goodness-of-fit test. Sir Francis Galton (1822 — 1911) worked 
on Regression and suggested the techniques which are used in modern statistics. A 
very famous and useful method of Analysis of Variance (ANOVA) was developed in 
1923 by Sir R.A. Fisher (1890 - 1962). The modern statistics in its present form is 
not only helpful to the rulers, but is also helpful to the experts working in different 
fields of life. Knowledge of statistics has become important for a common man. A 
person without a reasonable knowledge of statistics is not a literate person. Modern 
statistics is being applied in various fields of life. It increases our wisdom to 
understand the matters of practical life. A man equipped with the knowledge of 
statistics is a better ruler, a better policy maker and a better administrator, 
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1.1.1, OBSERVATION 


In statistics, numerical measure of some condition is called an observation, "Mr 
Ais a rich person", It is not an observation because the income of Mr. A has not been 
mentioned numerically. Income of Mr. A is Rs.6000 per month is an observation. Mr. 
B is intelligent, he is hardworking, he is very kind, he is a good player, he is honest. 
None of these statements is an observation. The work on statistics may be called 
statistical work. The statistical work starts with a set of observations. The 
observations may be about the wages of workers in factories, about prices of 
commodities in a market, about marks of students, about the land holdings of 
different farmers, the number of accidents per day on the roads, the daily production 
of a factory and so on, Thus we start with observations, Observations are the take off 
point for statistical work. Observations serve as raw material for statistical work. 
We take the observations by counting or by measurement. The number of patients in 
different hospitals are counted and their blood-pressures are measured. 


Weights of commodities consumed, weights of students, neck sizes of students, 
chest sizes of players are all result of measurements. Measurements are never exact. 
A weight of 60 kg. means that the weight lies between 59.5 and 60.5 kg. A weight of 
60.5 kg. means that it lies between 60.45 and 60.55 kg. The number of students in 
different sections are counted. Counting generates exact figures. 

1.1.2. POPULATION 

The word population or statistical population is used for all the individuals 
which possess some common characteristic and we are interested to know some 
properties of those individuals. The individuals may be called the study units. The 
study units may be living or non-living things. We may have to study a population of 
fans produced in a factory during certain period, we may have a population of fruit 
trees in a garden, a population of children in primary schools, population of patients 
in hospitals, population of beggars in a country, population of salaried persons in a 
country, population of fish in an ocean, population of cows in a country. A population 
may be finite or infinite. It is called finite if its individuals can be counted. For 
example the population of ihcome-tax payers, the population of smokers in a 
country. The population is called infinite when its individuals cannot be counted. 
For example the number of sun-rays and the number of dust particles on the surface 
of earth. The number of fish in an ocean may be very large but countable, The 
number of stars in a galaxy may be countable. Such populations are called countably 
infinite (very large but countable). The size of the population is denoted by N. If we 
have 500 individuals in a population, we write N = 500. The size of the population is 
usually very large. 

11.3. SAMPLE 


Sample is any part of the population which is selected from the population. The 
sample is studied and on the basis of the sample study, we try to know something 
about the population. This journey from the sample to the population is called 
inference which is an important branch of statistics. The method of selecting sample 
from the population is called sampling. The sample size is denoted by n. The ratio 
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& is called sampling fraction. Sample is studied to know the properties of the 


population, The sample study saves time and finances and in general the sample 
study provides results of better quality, In statistical studies, most of the work is 
done on sampling basis, The medicines, seeds, fertilizers and other products are first 
prepared on small scale and after approval, they are manufactured on large scale. In 
subsequent Chapters, we shall make many numerical calculations. All calculations 
will be about samples. Whenever there is a calculation about the population, it is 
mentioned clearly. 
1.1.4. PARAMETER AND STATISTIC 

Any measure of the population is called a parameter. Suppose we are 
interested to know the percentage of smokers in our country. This percentage 
pertains to the population of smokers and is called parameter. It is usually 
unknown. The symbol j(mue) is used for population mean and the symbol o*(sigma- 
square) is used for population variance. These symbols pertain to the population and 
are called the parameters. The parameters are usually unknown and are estimated 
through samples. Population parameter is a fixed quantity and may be called a 
constant. 

Any measure of the sample is called statistic, The statistic depends upon the 
sample. Different samples have, in general, different values of the statistic. The 


symbol X is used for sample mean and S? is used for the sample variance. These are 
the statistics (plural for statistic). The sample statistics are used to draw conclusions 
about the population parameters. The process of drawing these conclusions is called 
inference or inferential statistics. 
1.2, MEANING OF STATISTICS 

The word statistics has three different meanings which are discussed below: 
(a) Statistics in plural sense 

The word statistics in plural sense are the numerical observations collected for 
some definite purpose regarding some field of study. These observations may be for 
the sample or the population. These observations are also called data. The 
observations about the wages of workers are called the statistics of wages. The 
prices of commodities taken from the market are called statistics of prices. It is 
usually said that the statistics of education and agriculture are poor in our country. 
This statement means that we do not have the correct information about the school- 
going children, about the failures, about the drop-outs, about the results and about 
the unemployed educated persons, We have poor agricultural statistics because we 
do not have correct figures about the production and consumption of different crops, 
fruits and vegetables, One can talk about one's personal statistics @.g., age, weight 
education, income and children etc. We may say "he is a man of good statistics". It 
means that he has a good fortune. He has wealth, house and other facilities. 
(b) Statistics in singular sense 

Statistics in singular sense is a body of methods used in the collection, 
presentation, analysis and interpretation of data. The word scientific method is also 
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used for the steps which are taken to study the data. This meaning of the word is 
close to the word statistics as a subject. 
(c) Statistics as plural of Statistic 

Statistic is any measure of the sample. The proportion of smokers in a sample 


may be denoted by > . It is a statistic, We can find the mean X of the sample. It is 
also a statistic, When we have more than one statistic, we shall use the word 


statistics as plural for statistic. The sample mean X and the sample proportion D 
are the sample statistics. 


1,3, CHARACTERISTICS OF STATISTICS 


The word Statistics in plural sense has some characteristics which are 
discussed below. 
(i) Statistics are aggregate of facts 

A single observation does not make statistics. The income of Mr. A is 
Rs.20000/- per month is not statistics, The observations must be in a group 
expressed in proper units if applicable. The marks of some students may be 20, 30, 
40, 50, There is no unit of measurement. But if we talk about prices of meat in 
different markets, we must write the units i.e., 'Rs.' or '£' with the observations. The 
prices like 50, 55, 60, 62 do not make statistics. The figures as Rs.50, 55, 60 and 62 
are statistics provided they have been collected for the purpose of some study. 
(ii) Statistics are numerically expressed 

Only numerical statements make statistical data. The statements like "he is a 
good player" or "he is a poor person" are not the numerical statements. Income of 
Mr. A is Rs.8000 per month and that of Mr. B is Rs.9000 per month are the 
numerical statements. These numerical statements are the statistics of the income 
of Mr. A and B. 
(iii) Statistics are affected to a marked extent by multiplicity of causes 

Every observation is the net result of many forces. There are many forces 
which exert their influence in the field from which the observation is to be taken. 
The marks of a student depend upon his hard work, his intelligence, the intelligence 
of his teachers, the devotion of his teachers, the financial and the educational level 
of his parents, the locality in which he is living, the transport facilities and many 
other factors. Some of these forces are more powerful than others. The marks of a 
student may be high due to his hard work or due to his intelligence. The marks of 
some other student will be affected by another set of multiplicity of causes. Thus the 
observations (statistics) are affected to a great extent by many forces. 
(iv) Statistics are enumerated or estimated according to a reasonable 

standard of accuracy 

The measurements are never exact. There is nothing like exactness in nature. 
Suppose we take an observation on the weight of a player. His recorded weight is 70 
kg. Is it exactly 70 kg? No it is never exactly 70 kg. The weight lies somewhere 
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between 69.5 to 70.5 kg. It is only for practical’ purposes that we have recorded the 
weight as a whole number. We may make the observation on weight and our 
observation may be 70.2 kg. Is it exactly 70.2 kg? No, it is somewhere between 70.15 
to 70.25 kg. When we are interested in the weights of the players, it will be 
meaningless to take the observations up to 4 places of decimals in terms of kgs. 
Sugar consumed by consumers on daily basis may be measured in gms. The amount 
of iron in the body of human beings may be measured in milligrams. Consumption of 
meat by consumers per day will not be measured in tons or milligrams. The suitable 
measure will be 'kg'. The roads are not measured in cms and the length of rooms is 
not measured in kms. We take the observations according to a reasonable level of 
accuracy, When we make a measurement, we get an estimated figure and not exact 
figure. 

Enumeration means counting. The population of big countries is counted in 
millions, the population of small cities may be counted in thousands and for the 
population of villages the actual figures may be used. It may be noted that in the 
counting process the exact figures are possible but in measurements the exact 
measurements are never possible, Depending upon the situation, we use suitable 
units of measurement or counting. 

(vy) Statistics are collected in a systematic manner 

A systematic method of collection of statistics is adopted. Suppose we want to 
carry out a study about the guality of primary education in our country. First we 
shall make a list of all the primary schools, a sample of some schools will be selected. 
In the selected schools, we shall make lists of the students and teachers. The 
qualifications of teachers and the ages of students will be recorded. The scores of the 
students in the house examinations will be taken. All this work is to be done in a 
systematic manner. 
(vi) Statistics are collected for a pre-determined purpose 

The mere figures which do not serve any purpose are not statistics. Those 
figures make statistics which have been collected to carry out some study, survey or 
inquiry. When we take the price figures from the market to see as to how much 
increase/decrease has taken place in the level of prices, then the price figures are 
called statistics. These figures are to be used for a definite object in mind. 

(vii) Statistics must be comparable to each other 


Observations taken from one place should be comparable with the observations 
taken from some other place. Suppose we have to carry out a study about the 
literacy level in different provinces of our country. In the Punjab the definition of 
literacy may be different from the definition of literacy in the Baluchistan. If the 
definitions are different the statistics of literacy will not be comparable. Similarly in 
a study on housing facilities, a house in the N.W.F.P. may have a different meaning 
from a house in the Punjab. A farmer of the Sindh may not compare with a farmer of 
the Baluchistan. Statistics taken at different times should also be comparable. In 
the 1972 population census of Pakistan, the definition of a literate person was "a 
person who can read and write a simple paragraph in any language" and in 1981 
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population census the definition was "2 person who can read or write a simple 

paragraph in any language". Obviously the definitions were different. Thus the 

statistics of literacy in 1972 and 1981 are not comparable. The definitions should be 
consistent. 

Horace Secrist has given a comprehensive statement which includes all the 
characteristics described above. His statement is "statistics are aggregate of facts, 
affected to a marked extent by a multiplicity of causes, numerically expressed, 
enumerated or estimated according to a reasonable standard of accuracy, collected in 
a systematic manner for a predetermined purpose and placed in relation to each 
other". 

The above statement is a detailed definition of statistics in plural sense but 
this definition does not include the singular meaning of the word statistics. Thus 
this statement is not the complete definition of the word statistics. 

1.4. DEFINITIONS OF STATISTICS 
Statistics like many other sciences is a developing discipline. It is not 

something static. It has gradually developed during the last few centuries. In 

different times, it has been defined in different manners. Some definitions of the 
past look very strange today but those definitions had their place in their own time. 

Defining a subject has always been a difficult task. A good definition of today may be 

discarded in future. It is difficult to define statistics in a clear and acceptable 

manner. Some efforts which have been made in the past for defining statistics. Some 
of the definitions are reproduced here: 

(i) The kings and rulers in the ancient times were interested in their manpower. 
They conducted census of population to get information about their population. 
They used this information to calculate their strength and ability for wars. In 
those days statistics was defined as 

"the science of kings, political arithmetic and science of 
statecraft”. 

Today this definition may scem very strange but it is true that our Govts. use 

statistics. All the future planning is based on statistics. 

(ii) A.L. Bowley defined statistics as 

"statistics is the science of counting". 

This definition places the entire stress on counting only, A common man also 

thinks as if statistics is nothing but counting. Statistics today is not mere 

counting of people, counting of animals, counting of trees and counting of 
fighting forces. It has now grown to rich methods of data analysis and 
interpretation. 

(iii) A.L. Bowley has also defined it as 

"science of averages" 


This definition is very simple but it covers only some area of statistics. Average 
is very important in statistics. Experts are interested in average death rates, 
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average birth rates, average increase in population, average increase in per 
capita income, average increase in standard of living and cost of living, average 
development rate, average inflation rate, average production of rice per acre, 
average literacy rate and many other averages of different fields of practical 
life. Large number of experts are involved in the calculation of different 
averages. Thus ‘average! is an important calculation. But statistics is not 
limited to averages only. There are many other statistical tools like measures 
of variation, measures of correlation, measures of independence etc. Thus this 
definition is weak and incomplete and has been buried in the past. 

(iv) Prof, Boddington has defined statistics as 

"science of estimates and probabilities" 
This definition covers a major part of statistics. It is close to the modern 
statistics. But it is not complete because it stresses only on probability. There 
are some areas of statistics in which probability is not used. 

(v) A definition due to W.I. King is "the science of statistics is the method of judging 
collective, natural or social phenomena from the results obtained from the 
analysis or enumeration or collection of estimates". This definition is close to 
the modern statistics, But it does not cover the entire scope of modern 
statistics. Secrist has given a detailed definition of statistics in plural sense. 
His definition is given on the previous pages. He has not given any importance 
to statistics in singular sense. Statistics both in the singular and the plural 
sense has been combined in the following definition which is accepted as the 
modern definition of statistics. 

"statistics are the numerical statements of facts capable 
of analysis and interpretation and the science of 
statistics is the study of the principles and the methods 
applied in collecting, presenting, analysis and 
interpreting the numerical data in any field of inquiry". 

1.5. VARIABLE AND CONSTANT 
The word variable is used for something which can take different numerical 

values for different individuals or objects. On different shops the prices of apples 
may be Rs.30, 32, 30, 35, 36, 32 per kg. This set of numerical values is called the 
variable, If apples are being sold at Rs,32 on all the shops, then we have a fixed price 
and we say that the price is constant. It does not change from shop to shop. Different 
patients may have different blood pressures but they may have same level of 
temperature. Thus blood pressure is variable and in this example temperature is 
constant. Different workers may have different wages but their medical allowance 
may be fixed. Thus wages are a variable and medical allowance is a constant. For 
variables we shall use the symbols x, y and z and for constants we shall use the 
symbols 'a’, 'b' and 'c' etc. 

1.5.1. DISCRETE VARIABLE 
A variable is called discrete if it can take only some sclected values in a given 

interval. Suppose in a certain town, the minimum number of family members is 1 
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and the maximum number of family members is 7. No family will have 1.6 members. 
The number of family members must be 1, 2, 3, 4, 5, 6 or 7 and no other value 
between 1 and 7. There is a jump from 1 to 2 because we can think of any number of 
values between 1 and 2. This variable cannot take any value between 1 and 2. This 
variable is discrete and can take values with jumps. When we select 5 bulbs from a 
factory, the number of good bulbs may be 0, 1, 2, 3, 4 and 5. This is a discrete 
variable. If 30 students appear in an examination, the number of passing students 
may be 0, 1, 2, ..., 30. It is a discrete variable. Discrete variable is generated by 
counting or enumeration. It may also be called count variable. A set of observations 
on discrete variable is called discrete data. 


1.5.2. CONTINUOUS VARIABLE 


A variable is called continuous if it can possibly assume all values in a given 
interval. Suppose the minimum temperature of a city in summer is 35°C and 
maximum temperature of the season is 45°C. The temperature can take any value 
between 35°C and 45°C. It may be 35, 35.01, 35.02, 35.1, ..., 45°C. The observed 
values of temperature on different days may be 35, 41, 42 etc. but as far as 
possibility is concerned, the variable can take any possible value in the given range. 
Examples of continuous variable are heights of students, amount of rainfall on 
different days, the consumption of sugar per family etc. A continuous variable is 
generated by measurements. It may also be called a measure variable. A set of 
observations on a continuous variable is called continuous data. 


1.5.3. QUANTITATIVE AND QUALITATIVE DATA 


A set of observations generated by counting (enumeration) or by measurements 
is called quantitative data. Both discrete and continuous data come under the 
quantitative data. The data on the number of accidents on roads, the number of 
teachers in different schools, the intelligence quotients of the students, the 
temperature in different cities, the consumption of electricity in different houses are 
quantitative data. 

The word qualitative data is used for that information which is generated by 
observing the presence or absence of some quality (called attribute) in individuals. 
The individuals may be the students and the quality may be he intelligence. We 
examine whether cr noi intelligence is present in the students. .‘he observations are 
recorded with the help of 'yes' for intelligence and 'no' for non-intelligence. Thus 'yes' 
or 'no' generate the observations and ihe data thus obtained are called qualitative 
data. Some examples of qualitative data are given below: 


@) Number of Number of Total 
intelligent students non-intelligent students 
30 40 100 
(ii) Muu ber of Number of Total 
skilled workers unskilled workers 


3000 2000 5000 
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1.5.4, ERRORS OF MEASUREMENT 

It is possible to get exact observations in a process of counting whenever 
counting is possible. The number of students in different class rooms may be 30, 35, 
46 and so on. These are exact observations. But if we measure the weights or heights 
of these students, the observations will serve our purpose but they will not be exact. 
They will be approximate. The reason is that there are the limitations of the 
observer (may be human being or some instrument) and the limitations of the 
measuring instruments. An observation may be in a whole number as 60" height, it 
may be up to one place of decimal as 60.2. It will not be absolutely exact even if it is 
measured to any number of decimal places. We can increase its accuracy by 
increasing the number of decimal points if possible on the measuring instrument but 
the exactness can never be achieved. In fact, we never need that absolute exactness 
which is only something imaginary and does not exist in nature. The difference 
between the exact value and the observed value of the observation is called error of 
measurement. It is always there whenever a measurement is made. It can be 
reduced but it cannot be eliminated completely. These errors are of two types 
namely biased and unbiased . 

(i) Biased errors: 

If the observations tend to be higher or lower than the exact observations, the 
errors are called biased. Some observer may intentionally take wrong observations 
in which case bias is called intentional or he may not be capable enough to make the 
measurements. He may be inclined in one direction. A teacher may award marks to 
the students less than what they deserve. It is called one-directional bias. The errors 
involved are also called systematic or cumulative errors. These errors are not 
reduced by increasing the number of observations. 

(ii) Unbiased errors 

Unbiased errors occur when the observer makes the observations some of 
which are higher than the exact observations and some are below the exact 
observations. Some errors are positive and some are negative with the result that 
they cancel out each other when they are added. These errors are, therefore, called 
compensating errors. They are also called random or accidental errors. 

1.6. BRANCHES OF STATISTICS 

The science of statistics can be divided into following branches: 

(@) Theoretical statistics, Gi) Descriptive statistics, 

(iii) Inferential statistics, (iv) Applied statistics. 

1.6.1, THEORETICAL STATISTICS 

This branch of statistics deals with the formulas and rules which are used in 
statistical work. These rules are used in descriptive as well as inferential and 
applied statistics. 

1.6.2. DESCRIPTIVE STATISTICS 

This branch of statistics covers the methods and principles of collecting 

observations, their condensation and finding the salient features of the data. It also 
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covers graphic presentation. Some basic calculations are made on the data to find 
out its properties. Suppose we make a graph of the imports of Pakistan during the 
last 10 years. We can read the graph to see as to what has been the growth of 
imports in the past. This study belongs to descriptive statistics. 

1.6.3. INFERENTIAL STATISTICS 


This is the branch of statistics which deals with procedures of drawing 
inferences about the population on the basis of the information obtained from the 
sample. This is a very important branch of statistics. The knowledge of probability is 
used to make inferences about the population with the help of the sample study. 
Suppose we want to have an idea about the percentage of smokers in our country. 
We take a sample from the population and the proportion of smokers in the sample 
is calculated, This sample proportion with the help of probability enables us to make 
some inference about the population proportion. This study belongs to the area of 
inferential statisticg. 

1.6.4. APPLIED STATISTICS 

The statistical knowledge is used in different fields of life. It is used in the 
fields of business, economy, biometry, demography, agriculture, education, banking, 
insurance etc. The statistical wisdom helps us to understand the data in various 
fields of life. Suppose we take the prices of various commodities from the market. We 
can calculate the index numbers of prices to see the changes in the level of prices. 
This is a very important study for the consumers, for the businessmen and for the 
Govt. Such studies fall in the area of applied statistics. 

1.7. FUNCTIONS OR USES OF STATISTICS 
(i) It presents facts in a numerical form 


An important feature of statistics is that it deals with numerical statements. 
"We had heavy rains this year". This is a statement which may be interpreted in 
different ways by different persons. Let us make another statement. In this year the 
rainfall was 90 cm. as compared to 70 cm. during the last year. The second 
statement gives a clear picture of the amount of rainfall. This statement contains 
figures which are statistics. 
(ii) Statistics simplifies complex mass of data 

The size of data is usually very large. It is difficult for the human mind to make 
out something out of the huge amount of data. The data are converted into certain 
calculations like totals, averages and percentages etc. Suppose we have data about 
human population divided into males and females, literate and illiterate, urban and 
rural, children and adults and so on. ‘There is so much of information in this data 
that it is difficult to make some opinion about such a huge data. When the data are 
summarised by certain calculations, it becomes simple to understand it. 
(iii) Comparison of data becomes easier 

Sometimes different data are to be compared with each other. This comparison 
is not possible without the statistical tools. We apply some statistical methods which 
help us to compare these data. 
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(iv) Statistics studies relationship among different facts 
Sometime when a variable changes, it produces a change in other variables. 

Statistics helps us to carry out studies to find the relationship between two or more 

than two variables. It has been observed that when prices of oil rise, the prices of 

fruits and vegetables also rise. This indicates that relationship between two 
variables exists. Statistics helps in studying such relationship. 

(v) Statistics studies changes in a variable 
Changes in a certain system are studied by statistical methods. The population 

changes from time to time. The prices of different commodities change with the 

passage of time. There exist statistical methods for measuring these changes 

(vi) Statistics helps forecasting 
With the help of the available data, a fair idea can be formed about the future 

values of the variable. This is called forecasting. 

1.8. LIMITATIONS OF STATISTICS 
Statistics is applied in different fields of life. But it is not free from deficiencies. 

Some of the limitations of statistics are : 

(i) Laws of statistics are applicable on group of observations. Any single 
observation even if it is very important to a common man cannot be dealt with 
by statistical methods. 

(ii) Statistical methods are best applicable on quantitative data. 

(iii) Some errors are possible in statistical decisions. Particularly the inferential 
statistics involves certain errors. We do not know whether an error has been 
committed or not. 

1.9. IMPORTANCE OF STATISTICS IN DIFFERENT FIELDS 
Statistics is applied in all those fields in which data are involved. Data are 

studied with the help of statistical methods. Data analysis requires an expert 
knowledge and a common man without the knowledge of statistics cannot properly 
analyse the data, Good data may be spoiled by a person who is not expert in 
statistics. Application of statistics is very wide but we discuss here some important 
fields in which statistics is commonly employed. 

@) Business and Economics 
Statistical methods are widely used in the study of economic problems and the 

business activity. The relationship between supply and demand is studied by 

statistical methods and the decisions are supported by the observed data. The 
increasing prices are a big problem of business activity, The changes in prices cannot 
be studied by a common man. The imports and exports, the inflation rate, the per 
capita income are the problems which require good knowledge of statistics. The 

present level of prices is compared with the level of prices in the past. It helps a 

common man to have an idea about the level of prices in future. The wages of 

workers are linked with the level of prices. The problems of prices and wages are 
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very serious in our country. To study these problems, a thorough knowledge of 
statistics is required. 
(ii) Banks 

Statistics is very important to the banks. The banks do business with the help 
of deposits in the banks. The banks do not keep all the deposits with them. They do 
business with it and lend the money to the borrowers. The banks work on the 
assumption that all depositors do not withdraw their deposits on the same day. The 
bankers use statistical approach based on probability to estimate the number of 
depositors and their claims for a certain day. 
(iii) Insurance 

The knowledge of statistics is of great use in insurance business. The amount 
of premium for the insurance policies is based on life tables. 
(iv) Sciences 

In all sciences i.e., Physics, Chemistry, Biometry, Biology and Psychology, a lot 
of data are to be analysed. The analysis of data is possible only by an expert in 
statistics. 
1,10, COLLECTION OF DATA 

The first step in any investigation is collection of data. The data may be 
collected for the whole population or for a sample only. It is mostly collected on 
sample basis. Collection of data is not an easy task. For this job, the staff is trained 
and work is assigned to them. 
1.10.1. PRIMARY DATA 

The data are called primary if it is collected originally by some individual or 
agency and are original in character. The data is called primary if no statistical 
treatment is applied on the data. There are different methods of collecting the 
primary data. 
1.10.2. SECONDARY DATA 

The data are called secondary when the primary data have been treated by 
some statistical tool. If the primary data haye been treated statistically it is no more 
primary. It becomes secondary data. 
1.10.3, COLLECTION OF PRIMARY DATA 

Primary data are collected by the following method. 
(i) Direct Personal Observations 


In this method the enumerator goes to the respondents and interviews them. 
The information supplied by the informants is recorded by the enumerator. The 
information collected by this method is considered accurate and complete. The 
quality of the information (data) depends upon the two persons i.e., the respondent 
and the enumerator. This method of collecting data is very costly Pega’ a team of 
enumerators is required to get the observations. It is suitable for small local 
inquiries. It is not suitable if the area of inavzry is wide. Lot of funds and human 
resources are required for an extensive field ui 
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(ii) Indirect Personal Investigation 


This method of collecting primary data is adopted when the respondents do not 
supply the correct information. They do not cooperate with the enumerators. If 
people are asked to supply information about their income, their houses and other 
holdings, they are quite likely to give wrong information. In this method some 
agency other than the respondent is selected to supply the data. For example if data 
is required from shop keepers, their trade unions can be consulted to supply the data 
about their members. Indirect questions can also be asked from the respondent so 
that he speaks out the correct figures. The respondent can be asked about the size of 
his house, his facilities, and other services like gas, telephone, and electricity etc. By 
indirect questions the respondent can be forced to give the correct answer. 


(iii) Questionnaire Method 


Questionnaire is a list of questions relating to the field of inquiry. This list of 
questions is sent by mail to the respondents with a request that they send it back 
after duly filling the entries. This is a cheap method but this method is not 
successful in areas where people are not literate and cooperative. It is possible that 
the people do not return the questionnaire because they are afraid of giving 
information in writing. The framing of questionnaire is quite difficult. Only an 
expert can frame a good questionnaire. A questionnaire should have the following 
characteristics. 


(i) Its language should be simple. 
(ii) Questions should have an objective answer like 'yes' or 'no'. 
(iii) Number of questions should not be very large. 


This method is very popular and is being used in big cities where most of the 
people are literate. 


(iv) Collection through Enumerators 


This method is used to overcome the difficulties of the illiterate people in filling 
the questionnaire. The trained enumerators go to the informants and they assist 
them in filling the questionnaire correctly. This is a very costly inquiry and can be 
carried out by the Govt. only. 

(v) Collection through Local Sources 

These are the reports of the local representatives sent to the investigator. The 
data is not formally collected by local representatives. They only send the reports. In 
our country the Patwari sends regular reports to the Govt. regarding the crop 
conditions in our country. These reports are only the estimates based on the 
judgements. 


1,10.4, COLLECTION OF SECONDARY DATA 


The secondary data are not collected from individual units but are taken from 
different sources called secondary sources. Some of these sources are as follows: 
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(i) Official Statistics 

All the government departments maintain their data and publish it annually 
as official statistics. The important publications are those of the Statistics Division 
and Bureaus of Statistics in different provinces. 
(ii) Semi Official 

The data available with PIA, railways, State Bank of Pakistan, district councils 
etc, are called semi official. 
(iii) Secondary data can also be obtained from magazines, news papers and journals 

of the universities 
1.11 ACCURACY 

It is impossible to study statistics without an amount of arithmetic work. 
Although many statistical results are not required to a high degree of accuracy, 
there is often a risk of considerable loss of accuracy during a calculation. As a 
working rule two more significant figures should be used in a calculation than are 
required in the final answer. This means that answers derived using four figure 
logarithm tables should only be quoted to two significant figures. 
1.12 SIGNIFICANT FIGURES 
By significant figures we mean those figures in a number which give its 

information other than its magnitude. The following numbers are all given to four 
significant figures : 


When a final zero is shown in a decimal fraction, such as, 0.05290 , it must be 
taken as a significant figure. Significant figures of some numbers are given in the 
following table . 


Number : C 0.00262 6200 4.5800 
Significant 
Figures 


0.003700 | 0.0037 | 0.95 | 1.00580 | 378 | 100.00 | 0.09001 | 
‘ignificant 
—— 
1.13 ROUNDING OFF FIGURES 


When it is required to express figures to a smaller number of significant figures 
than are available they must be rounded off. The figure following the last significant: 


figure is examined. If it is less thand then the last significant figure stands as it is, 
e.g., 523 becomes 520. If it is greater than 5 or 5 followed by any further figures, it 
is increased by 1, e.g., 523.6 becomes 524 and 523.502 becomes 524. When this 
figure is exactly 5, there is a problem. Any rule such as “ round 5’s down” , or 
“round 5's up” must introduce bias, so the following rule is preferred. Round off the 
5 so that the last significant figures become even, e.g., 1975 becomes 1980 and 1965 
becomes 1960. This rule should produce as many roundings up as roundings down 
and so avoid bias. 
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Example 1.1 
Round the following numbers off to three significant figures. 
(a) 26243 (b) 2624 (c) 2626 (d) 26.24 (ec) 0.2624 (f) 0.002626 
(g) 0.005901 (h) 0.005905 (i) 0.3875 (j) 4.315 (k) 0.004500 
Solution: 
(a) 26200 (b) 2620 (c) 2630 (d) 26.2 (e) 0.262 (f) 0.00263 
(g) 0.00590 (h) 0.00590 (i) 0.388 Gj) 4.82 (k) 0.00450 
Example 1.2 
Round the following numbers off to three decimal places. 
(a) 2.6666 (b) 7.6665  (c) 9.6755 (a) 10.67 (e) 267 
(f) 0.000717 (g) 0.4295 (h) 0.97045 (i) 125.9995 (j) 43.8705 
Solution: 
(a) 2.667 (b) 7.666 (c) 9.676 (a) 10.670 (e) 267.000 
(f) 0.001 (g) 0.430 (h) 0.970 (i) 126.000 Gj) 43.870 
1.14 MATHEMATICAL NOTATION 


In addition to mathematical notations in everyday use, certain other less- 
common notations will be used; a short description of these is given for reference. 


1.14.1 MULTIPLICATION 

A cross ‘x’ as well as a stop ‘ will be used to indicate that two or more items 
must be multiplied together ; also where no confusion arises the dot may be omitted. 
For example, X log X =X x log X= X log X 

a. b=axb=ab 
1.14.2 INEQUALITIES 
The inequality signs are used with the following meanings : 
X= Y X is greater than Y 
X2yY X is greater than or equal to Y 
X<Y X is less than Y 
kK <¥ X is less than or equal to Y. 

The sense of the inequality may be remembered by noting that the open end of 
the sign is placed towards the larger item. A common use of the inequality signs is 
15 < X < 20, meaning that X has a value between 15 and 20, or 15 <X< 20, where 
the values 15 and 20 are also included. 

1.14.3 APPROXIMATIONS 

X = Y, Xis approximately equal to Y. This will be used when approximations 
are used in calculations where less accuracy is required than has been kept in the 
calculation, It implies that the values of X and Y can be regarded as equal on the 
particular occasion being considered, although they may be distinguished on other 
occasions. 
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1.14.4 LIMITS 

An arrow will indicate that a quantity tends to a certain value, and under the 
described conditions approaches closer and closer to it. For example, X > 5, X tends 
to 5, implies that X becomes closer to 5, but not that X ever equals 5. We can 
imagine that (5 — X ) is less than any real amount, however small it may be. In 
particular X — 0, X tends to zero, implies that X becomes very small; X + ©, X 
tends to infinity, implies that X becomes very large. Note that X — 0 does not imply 
that X ever becomes zero, and the size of X implied by X — will depend on the 
circumstances. 
1.14.6 MODULUS 


The sign‘ | |’ implies to take the numerical value of the quantity inside the 
sign and make it positive. For example, 
|5| = 5, |-5] =5, [15-10] =5, |10-15| =5. 


1.14.6 FACTORIALS 

If n is a positive whole number, the product 1x 2x 3x x nis called ‘ factorial 
n’ and is given by the symbol n!. So 

nl! = 1x2x3x-exn 
1x2x3=6 §l= 1x 2x3x4x5= 120. 

8! = 1x2x3x4x5x6x7x8= 40320. 

Many hand calculators have the facility for calculating factorials. Look for a 
button marked n! or X!. Zero is the only other number, a part from the positive 
whole numbers, for which there is a definition of factorial. 0!= 1. But (—3)! is not 
defined and so is meaningless. 

1.14.7 SUMMATION (2) 

We will often require the sum of a number of similar quantities; for example, 
X, + X. + X, + - + X, where n quantities of the type X are added together. This 
operation of summation is symbolised by the use of the Greek capital letter £. The 
full notation for the above sum would be 


rd 
i 


n 
p2.8= X,+X,.+Xs+...+X, 


We can see that the symbol 2 (the upper-case version of the Greek letter sigma) 
stands for the operation of summing. In future, it will help you to remember what = 
means in statistics if you remember “ capital sigma means sum”. If it is clear how 

n 
many values are to be summed we simply write =X instead of . pe So Ex means “ 
i= 


sum the X values”. Here is some other useful notation: 


=X? means square each value of X, and then sum. 
(=X)? means sum the values of X and then square the total. 
=r(X-38) means subtract 3 from each value of X, and then sum. 


=XY means multiply each value of X and Y and then sum. 
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Consider the following examples : 


n 
(i) At ht ft th = 28 
i) XP +X x= x? 
a) 1 +X, + X3 + oe LT Sa Ae 


(aii) (Mi + Yi) + (Ke + Ya) + (Kat Ya) + + (Xa Ya) = E (Kit %) 
4 
(iv) COP SER Kat BE CME SECs )PE ZO Fr 3) 
N 
(v) (ayb1)+ (agbg) + (agbg) + +++ + Canby) = ; 2 Caibi) 


(vi) ORa LAG a 2088 Ke 8B Kod jak (Ks — 5)" y 


o 


(vii) aX, +a2X,+ a3Xj + adX, + a5X, rs ZL aiX; 


1 
n 

(viii) [(Xi-A) +4 -A) HOA) EAE, E 

(ix) f)Xyt X_+ Xs +--+ Xn = . 2. FX, 

(x) atatat +--+ a(ntimes) = na 

(xi) ayt+ ag + ast ta, = . 5. aj 


n 
(xii) CaRre bt (aka) (ake b ah Cake bh) = 2 (aXitb). 


10 
Gail) (Yi — a )# fC Yp— a) fy Ys—a 2+ vet fio Yio-a P= =z, f(%i- a) 


Example 1.3 


The following corresponding values of X and Y have been obtained : 


3 


1 
12 | 10 | 6 
Calculate (i) EX Gi) EY (iii) DX? (iv) ZY? (v) EXY 
(vi) (ZX)? (vii) (ZX) (TY) (viii) B |X - Y| (ix) = XY? 
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Solution: 
(i) =X =14+3 +4+5+7+9+12+14 = 55 
(ii) TY = 12+10+6+9+4+0-1-5 = 35 
(iii) 2X? = (1)? + (3)? + (4)? + (6)? + (7)? + (9)? + (12)? + (14)? 
= 1+ 9+16+ 25+ 49 + 81+ 144 + 196 = 521 
(iv) ZY? = (12)? + (10)? + (6)? + (9)? + (4)? + (OF + C1)? + (5)? 
= 144+ 100 +36+ 81+ 16 + 0+ 1 + 25= 403 
(vy) EXY = 1(12) + 3(10)+ 4(6)+ 5(9) + 7(4) + 9(0) + 12(-1) + 14(-5) 
= 12+ 30 + 24 + 45 + 28+ O- 12 - 70 = 57 
(vi) (2X)? = (55)? == 3025 
(vii) (EX) (ZY) = (65)(35) = 1925 
wii) 1X —Y| = | 1-12] + |3-10]+|4-6]+ |5-9| +|7-41 + |9-0] 
+|12+1] +|14+5] 
=1ll+7+2+4+3+9+ 13+ 19= 68 
(ix) EXY? = 1(12)? + 3(10)2+ 4(6)?+ 5(9)? + 7(4)? + 9(0)? + 12(-1)? + 14-5)? 


ul 


144+ 300 + 144 + 405 + 112+ 0+ 12 + 350= 1467 
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SHORT DEFINITIONS . 
Observation 
Anything that can be measured or observed is called an observation. 
{ Data) 


bers or measurements that are collected as a result of observations. 
or 
The facts and figures that are collected, analyzed and interpreted, 
Apopulation is the set of all units of interest in a particular study, 
or 


The complete set of all objects possessing some common charactoristic of interest is 
called a population. 


- Asample is a subset of data selected from the population. 
or 


A sample is a part of the population that is under study. 


Parameter 


A parameter is a numerical measurement describing some characteristic of a 
population. 

or 
A parameter is a value associated with a population. 


Statistic i 
Statistic is a value computed from a sample. 
or 


A statistic is a numerical measurement describing some characteristic of a sample. 
atistics is the science of collecting, organizing, presenting, analyzing, and 
interpreting data to assist in making more effective decisions. 

or 
A body of methods dealing with the collection, description, analysis and 
interpretation of information that can be given in numerical form. 
Characteristics of Statistics 
Statistics are aggregate of facts, affected to a marked extent by a multiplicity of 
causes, numerically expressed, enumerated or estimated according to a reasonable 
standard of accuracy, collected in a systematic manner for a predetermined purpose 
and placed in relation to each other. 
Objective of Statistics 


The objective of statistics is to make inferences about a population based upon 
information contained in a sample. 
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A rs is a phenomenon that may vary from one individual or object to another. 


or 


Any characteristic of a person, group, or environment that can vary or differ. 


a aantity which is fixed is called a constant. 


or 
A value that is unchangeable is called a constant. 
Quantitative Data 
Quantitative data are observations measured on a numerical scale. 
or 


Quantitative data are always numeric and indicate how much or how many of 
something. 


Qualitative Data 
Qualitative data are observations that are nonnumerical. 
or 
Data that provide labels or names for a characteristic of an element. 
iscrete Data 
Data whose possible values are countable is called discrete data. 


Continuous Data 


possible values are uncountable and which may assume any value in an 
interval, is called continuous data. 


When the variable to be studied can be reported numerically, the variable is called a 
quantitative variable. 


When the characteristic being studied cannot be recorded in numerical form, it is 
called a qualitative variable. 
Discrete Variable 
A discrete variable is one that can assume only certain values within an interval. 
or 
A discrete variable is limited to some certain numerical values. 
Continuous Variable 
A continuous variable can take on all values within a specified range. 
or 
A variable that can assume an unlimited number of intermediate values between a 
given range. 
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Independent Variable 
A variable is called an independent variable if it is not influenced by another 
variable, 

or 
A variable that provides the basis for estimation, is called independent variable. 
Dependent Variable 
The variable that is being predicted or estimated, is called dependent variable. 

or 
A variable whose values are thought to be a function of the values of independent 
variable, is called a dependent variable. 
Descriptive Statistics 
Descriptive statistics attempt to present data in numerical, graphical or tabular 
form in order to convey information clearly. 

tive. celle desea etl Ms 3 
Descriptive statistics consists ane methods for organizing and summarizing 
information in a clear and effective way. 
Inferential Statistics 
inferential statistics consists of methods of drawing conclusions about a population, - 

based on information obtained from a sample. 

or ca 
The branch of statistics concerned with using sample data to make an inference 


ata are those which are collected for the first time and are thus original in 


or 
Data that are collected by any body for some specific purpose and use, are called 
primary data 
Secondary Data 
secondary data are those which have already been collected by some other person 
and which have passed through the statistical process at least once. 

or 
Data that are collected and compiled by any other source, is called secondary data. 


~\ 
MULTIPLE - CHOIC 


1. The science of collecting, organizing, prasenting, analyzing and interpreting 
data to assist in making more effective dedisions is called: 
(a) statistic (b) \ parameter 
(c) population (da) 
2. Methods of organizing, summarizing, and presenting data in an informative 
way is called: 
(a) descriptive statistics (b) inferential statistics 
(c) theoretical statistics (d) applied statistics 


10. 


11. 


12. 


13. 


14, 


15. 


— ——— — 
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The methods used to determine something about a population on the basis of a 


sample is called: 

(a) inferential statistics (b) descriptive statistics 
(c) applied statistics (a) theoretical statistics 
When the characteristic being studied is nonnumeric, it is called a: 
(a) quantitative variable (b) qualitative variable 

(c) discrete variable (d) continuous variable 
When the variable studied can be reported numerically, the variable is called a: 
(a) quantitative variable (b) qualitative variable 

(c) Independent variable (a) dependent variable 

A specific characteristic of a population is called: 

(a) statistic (b) parameter 

(c) variable (da) sample 

A specific characteristic of a sample is called: 

(a) variable (b) constant 

(c) parameter (d) statistic 

A set of all units of interest in a study is called: 

(a) sample (b) population 

(c) parameter (d) _ statistic 

A part of the population selected for study is called a: 

(a) variable (b) data 

(c) sample (d) parameter Z 
Listing of the data in order of numerical magnitude are called: 

(a) raw data (b) arrayed data 

(c) discrete data (d) continuous data 
Listing of the data in the form in which these are collected are known as: 
(a) secondary data (b) raw data 

(c) arrayed data (d) qualitative data 


Data that are collected by any body for some specific purpose and use are 
called: 


(a) qualitative data (b) primary data 

(c) secondary data (d) continuous data 

‘The data which have under gone any treatment previously is called: 
(a) primary data (b) secondary data 

(c) symmetric data (d) skewed data 

The data obtained by conducting a survey is called: 

(a) primary data (b) secondary data 

(c) continuous data (d) qualitative data 

The data collected from published reports is known as: 

(a) discrete data (b) arrayed data 


(c) secondary data (d) primary data 
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16, A survey in which information is collected from each and every individual of * 
the population is known as: 


(a) sample survey (b) pilot survey 
(c) biased survey (d) census survey 
17. Data used by an agency which originally collected them are: 
(a) primary data .  (b) raw data 
(c) secondary data (d) grouped data 
18. Registration is the source of: 
(a) primary data (b) secondary data 
(c) qualitative data (d) continuous data 
19. Data in the population census reports are; 
(a) ungrouped data (b) secondary data 
(c) primary data (d) arrayed data 
20. Issuing a national identity card is an example of: 
(a) sampling (b) _ statistic 
(c) census (d) registration 
21. Avvariable that assumes only some selected values in a range is called: 
(a) continuous variable (b) quantitative variable 
(c) discrete variable (d) qualitative variable 
22. A variable that assumes any value within a range is called: , 
(a) discrete variable (b) continuous variable 
(c) independent variable (d) dependent variable 
23. A variable that provides the basis for estimation is called: 
(a) dependent variable (b) independent variable 
(c) continuous variable (d) qualitative variable 
24, The variable that is being predicted or estimated is called: 
(a) dependent variable (b) independent variable 
(c) discrete variable (d) continuous variable 
25. Monthly rainfall in a city during the last ten years is an example of a: 
(a) discrete variable (b) continuous variable 
(c) qualitative variable (a) independent variable 
26. The proportion of females in a sample of 50 accounts officers is an example of a: 
(a) parameter (b) _ statistic 
(©) array (d) variable 
27. Number of family members in different families in a town is an example of a: 
(a) discrete variable (b) continuous variable 
(c) dependent variable (d) qualitative variable 
28. Colors of flowers is an example of: 
(a) quantitative variable (b) qualitative variable 
(c) skewed variable (d) symmetric variable 


29. If each measurement in a data sct falls into one and only one of a set of 
categories, the data set is called: 
(a) quantitative (b) qualitative 
(c) continuous (a) constant 
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30. 


31. 


32. 


33. 


34. 


35. 


36. 


37. 


38. 


89. 


40. 


41, 
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Any phenomenon which is not measurable is called: 


(a) variable (b) constant 

(c) parameter (d) attribute 

A constant can assume values: 

(a) zero (b) one 

(c) fixed (d) not fixed 

A value which does not change from one individual to another indiviaual is 
called: 

(a) variable (b) statistic 

(c) constant (d) array 

In the plural sense, statistics means: 

(a) numerical data (b) methods 

(c) population data (d) sample data 

In the singular sense, statistics means: 

(a) methods (b) numerical data 

(c) sample data (d) population data 
Weight of earth is a: 

(a) discrete variable. (b) qualitative variable. 
(c) continuous variable. (d) difficult to tell. 


Weight of students in a class marks is a: . 


continuous data. 
constant data. 


continuous variable 
constant 


secondary data. 
true data. 


secondary data. 
true data. 


Students divided into different groups according to their intelligence and 


(a) discrete data. (b) 
(c) qualitative data. (d) 
Life of a T.V. tube is a: 

(a) discrete variable (b) 
(c) qualitative variable (d) 
Questionnaire method is used in collecting: 
(a) primary data. (b) 
(c) published data. (d) 
Census returns are: 

(a) primary data. (b) 
(c) qualitative data. (d) 
gender will generate: 

(a) quantitative data. (b) 
(c) continuous data. (d) 
Svatisties are: 

(a) aggregate of facts and figures. —_(b) 
(c) always continuous. (d) 


qualitative data. 
constant 


always true. 
always qualitative 


51. 
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42. Statistical results are: 
(a) randomly true (b) always true 
(c) not true (d) true on average 
43. Statistics does not study: 
(a) constant (b) statistic 
(c) parameter (d) individual 
44. A statistical population may consist of: 
(a) finite number of values (b) infinite number of values 
(c) either of (a) and (b) (d) none of (a) and (b) 
45. The only continuous variable here is: 
(a) rain fall on different days in a city. 
(b) number of customers entering a store on different days. 
(c) number of flights landing on an airport on different days. 
(d) none of them 
46. Example of descriptive statistics is: 
(a) 70% people in Pakistan live in rural areas. 
é (b) 50% people are likely to vote in the national election. 
{ (c) 20% of the bulbs produced in a factory will be defective. 
“  (d) difficult to tell. 
& Example of inferential statistics is: 
\ (a) percentage of smokers in Pakistan. 
(b) percentage of skilled workers in a factory. 
‘(c) estimate of increase in prices in the next year. 
(d) none of the above 
48. Statistics are always: “ 
(a) exact. (b) estimated values. 
(c) constant. (d) population values, 
49. Statistics must be: i 
(a) comparable. (b) not comparable. 
(c) discrete in nature. (d) qualitative in nature. 
50. Given 6 quantities, X, through X,, the correct notation for adding quantities 3 
through 6 is; 
3 6 
(@@) =X, ) =X 
i=6 i=l 
N 6 
@© =X, @ ZX, 


ts 
o 


4 
Given: X, = 12, X,= 19, X, = 10, X,=7, =x equals: 
is 


(a) 36 (b) 48 
() 41 (d) 29 
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n 
52. The symbolic notation — 2N tells us to: 
i= 
(a) add all quantities from Y, through Y, 
(b) add all quantities from Y = 2 through Y, 


(c) add all quantities from Y = 2 through Y=n 
(d) add all quantities from Y, through Y,, 


n 
53. 2 ( Xj -A) equals: 
i= 


n n : 
(a) 2 XC-A) (b) 2X nA 
a 
() nX,-nA @ =Xx-A 
54. The figure 22.25 rounded to one decimal place is: 
(a) 22.3 (ob) 22.1 
(ce) 22.2 _(d) 22 
55. The figure 22.15 rounded to one decimal place is: 
(a) 22:2 (b) 22,1 
(ce) 22 . (d) 22.3 
56. The figure 22.26 rounded to one decimal place is: 
(a) 22.2 (b) 22.3 
@* 221), (@) 22 
57. The figure 22.24 rounded to one decimal place is: ' 
(a) 22.2 (b) 22.3 
(ce) 22.1 (d) 22 
58. How many methods are used for the collection of data? 
(a) 4 (b) 3 ©) 2 @ 1 


15. 


1 

9 (b) (©) | 16. 

| 17. (a) | 18. _(b) 20. (a) 23. (b) [ 24. (a) 
25. (b) | 26. (b) | 27. (a) | 28. @) 129. @ | 30. @ [31 © | 32. © 


33. (a) | 84. (a) | 35. (© | 36. () | 37.) | 38. (a) | 39. (a) | 40._(b) 
41 (a) | 42 (@)| 43 (a) {44 @©/4 @/46 @| wm [48 
52 (a) | 53 (b) | 54 © | v8 (a) | 56 &) 
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Qu. 
Q.2 

Q.3 

Q.4 

Q.5 

Q.6 

Q.7 

Q.8 

Q.9 

Q.10 
Q.11 
Q.12 
Q.13 
Q.14 
Q.15 
Q.16 
Q.17 
Q.18 
Q.19 
Q.20 
Q.21 
Q.22 
Q.23 
Q.24 
Q.25 
Q.26 
Q.27 
Q.28 


Q.29 
Q.30 
Q.31 


Ans: 


Q.32 
Ans: 


SHORT QUESTIONS 
What is meant by a population? 
Define population and write down any six examples of population. 
What is meant by a sample? 
Differentiate between population and sample. 
Define parameter and statistic. 
Differentiate between parameter and statistic. 
Discuss the meaning of statistics in the singular and the plural sense, 
Discuss the meaning and scope of statistics. 
Write down the main characteristics of statistics. 
Define statistics. 
Write down the definitions of statistics. 
Define statistics in plural sense. 
Distinguish between “statistics” and “statistic”. 
Differentiate between variable and constant. 
Distinguish between discrete variable and continuous variable. 
Explain the quantitative and qualitative data. 
Define qualitative variable, giving examples. 
Differentiate between quantitative data and qualitative data with examples. 
Name the branches of statistics. 
Differentiate between descriptive and inferential statistics. 
Explain the functions of statistics. 
Explain the importance of statistics in different fields. 
Name the sources of primary data. 
Discuss the questionnaire method used in the collection of primary data. 
Define secondary data. ; 
Write down the different sources of secondary data.. 
What is the difference between primary and secondary data? 
Distinguish between primary and secondary data. Describe some methods of 
collection of primary data. 
Write down the different types of investigations. 
Write down the different kinds of statistical investigations. 


4 
Given: X, = 8, X,= 1, X= 14, X= 2, X,=-10, Xy=4. Find E( Xi+ b), 


when b= 1. 
18 


5 
Given: X, = 2, X,=8, X,=—6, X,= 1, X,=0. Find E (Xia), when a=2, 


-5 


28 : Basic Statistics Part-I 


5 
Q.33 Given: X, = 8, X, = 1, X; = 14, X, = 2, X,=— 10, X= 4. Find E (Xj+ a), 
¥ 


when a= 2. tf 


Ans; 15 
4 
Q.34 Given: X, = 8, X,=-—6, X,=1. Find 26 Xj-a ), when a=0. 
Ans: 3 
5 

Q.35. If X,= 1, X;=0, find Be Xj—a), when a= 3. 

i= 
Ans: -5 

; 5: ae 5 q 5 2 
Q.36 Given 2k = 120 and Zhe 5. Find 26 Xj + 2Xj+1). 
i= i= i= 


Ans: 135 
5 5 
Q.37 Given Y,=a+bX, 2 X,= 100, a= 15 andb=2. Find EY, 
i= i= 


Ans: 275 
4 4 
@.38 Given X,=a+bY, ¥ Y\=20,a=5andb=—0.5. Find 2 X,. 
i= i= 
Ans: 10 
3 8 
Q.39 Expand the following summation signs: (a) 2,6 ny? +i) (@) 2,6 Yitp) 
i= Ss 
Ans. (a) Y{+Y3+Y3+6  (b) Ys+Yot¥r+Ys+ 4u 


Q.40 Given Df = 10, Df X = 20 and Lf X? = 100. Evaluate Lf ( X - 2)*. 
Ans. 60 
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EXERCISES 


Q.1 Determine whether each of the following variables is quantitative or 
qualitative. 


Ans: 


Q.2 


Ans: 


Q.3 


@) 
(ii) 
(iii) 
(iv) 
(v) 
(vi) 
(vii) 


Amount of money spent on books. 
Mode of payment. 

Gender 

Number of local calls made per month 
Colour of eyes 

Number of textbooks purchased 

The age of customers in a shop 


(viii) The classification of all employees in a college as principal, professors or 


(ix) 
(x) 
@) 
(v) 
(ix) 


clerks. 

Length of telephone calls recorded at a switchboard. 

Number of family members 

Quantitative (ii) Qualitative (iii) Qualitative (iv) Quantitative 
Qualitative (vi) Quantitative (vii) Quantitative (viii) Qualitative 
Quantitative (x) Quantitative 


Which of the following represents discrete data and which represents 
continuous data. 


@ 
(ii) 
(iii) 
(iv) 
(v) 
(vi) 
(vii) 


Number of houses in different cities. 

The yearly income of shopkeepers, 

The number of flowers on plants. 

The life time of electric bulbs. 

The length of rolls of cloth. 

The weights of students in a college. 

Temperature of a patient at different intervals of time. 


(viii) Monthly rainfall in a city during the last ten years.” 


(ix) 
(x) 
@ 

(vy) 
(ix) 


Total number of shares sold on different days in the stock market, 
Speed of an automobile. 

Discrete (i) Continuous (iii) Discrete (iv) Continuous 
Continuous (yi) Continuous (vii) Continuous (viii) Continuous 
Discrete (x) Continuous 


For the following statements, decide whether the variable is continuous or 
discrete. 


(i) 

(ii) 
iii) 
(iv) 


The number of hours for which a light bulb being functional. 

The number of current account balances checked by a firm of auditors 
each year, 

The number of rooms-with-bathroom in three-star Pakistani hotels. 
The number of failures per 100 hours of operation of a large computer 
system. 
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(v) The number of hours lost per 100 hours due to failures of a large 
computer system. 

(vi) The number of cars manufactured by a company each month. 

(vii) The annual rainfall in Islamic countries in 1994. 

(viii) The number of earthquakes per year in Japan in the period 1940 to 
1994. 

(ix) The output of Dhodak oil rigs in 1990. 

(x) The number of times rats turn right in 10 encounters with a T- 
junction in a maze. 

(xi) Number of road accidents on the G.T. road between Lahore and 
Rawalpindi. 

(xii) The weight of rice in kg. produced per acre. 

(xiii) The reaction time of rats to a stimulus. 

(xiv) The number of errors per page of a balance sheet. 

(xv) The yield of tomatoes per plant in a greenhouse. 

(xvi) The amount of milk produced by a cow during different months. 

(xvii) The number of T.V. licences issued by any branch of a bank each year. 

(xviii) The percentage of hydrogen content of gases collected from samples 
near a volcanic eruption. 

(xix) A finite number of values or a countable number of values. 

Ans:(i) Continuous (ii) Discrete (ii) Discrete (iv) Discrete 

(v) Discrete (vi) Discrete (vii) Continuous (viii) Discrete 

(ix) Discrete (x) Discrete (xi) Discrete (xii) Continuous 

(xiii) Continuous (xiv) Discrete (xv) Continuous (xvi) Continuous 

(xvii) Discrete (xviii) Continuous. (xix) Discrete 


Q.4 Identify the following quantitative variables as discrete or continuous. 


() The number of customers arriving between 10 and 11 A.M at bank. 
(ii) Area of an office. é 
(iii) The inventory of cars for sale at an automobile dealer. 


(ivy) Distance required for stopping an automobile travelling at 80 kilometers 


per hour. > 
(v) Price of a share in Karachi Stock Exchange. 
Ans:(i) Discrete (a) Continuous (iii) Discrete 
(iv) Continuous (v) Continuous 


Q.5 Write out each of the following summations: 


@ 


8 4 
pa Gi) ,E,(N-9)? GD) BC-AD 
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5 3 
(iv), 2, XP A ) (vi) E,CXj+¥j-3})? 


ime IM 
° 


n = 
(vii) pea Ye wilt) f {625i-2)* Gx), 2, §CX-XYP 


Ans: 

@ Xy+ Xs + Xo + Xp + Xp 

Gi) (¥,-3)? + (¥,-3)#+ (Y¥3-3)? + (¥,-3)? 

(iii) (X,-A)+(X,-A) +(X,-A) =X, + KX, +X,-3A 

(iv) X3 f+ Xo t+ xX? f 

(v) cto+ct+.. + c(Ntimes) = Ne 

(vi) CX, +X, Z1)8 + (Xa Yo— Za)? + (Xa Ya — Za)? 

(vii) Yi+y2 +y?2 +y? 

(viii) (8X,- 2)? + (3X,-2)? + (8X,;-2)? 

Gx) (Xi - XK) + (GX) + H(M—-K + + (XX! 
Q.6 Use the summation symbol to represent the following expressions: 

Gi) Xg+Xyt+ Ky +--+ Xoo 

Gi) eX, + eK, +eX3 + ++ + cXy 

(ii) £,X7 D] + X5 DZ + fX} Dj +--+ foXt) D%y 

(iv) f, (X,-a)? +f (X,—a)? + fy (X— a)? 

(vy) fiQavi) + f2(ue,ve) + fa(ug,vs) + + + fC, Vi) 


(vi) ay(X) +1) + ag(X_ +2) + ag (Xg+3) +--+ + ayo( Xi + 10) 


Ans: 
20 N 10 
@ 2% Gi) 2 cX (ii), Z| £7 D? 
i=8 t=1 t=1 
3 k 10 
Gv) 2 f(Xi-a) (vy) = f(y, vi) (i) Ba (Xi+i) 
i=1 i=1 t=1 
Q.7 Given ai=3, a:=5, as=8 and as= 2, evaluate 
@) Za Gi) Z( 6-a ) Gi) + 2( 4da-2) 
(iv) Za? (v) B( a-2 )? (Wi) 22(a-8) 
Ans: 


@ 18 Gi) 6 (iii) 64 (iv) 102 (v) 46 (vi) 12 
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Q.8 Find the value for each of the following expressions: 
6 5 4 
G) 2s St (ii) Fa (j?+3) Gi) LZ (3k-2) 
#=1 j= k=1 
5 i 3 6 
iv) ,E (+t) @) “PE (@+8e-2) wi) E (25-38) 
Ans: 
G) 63 Gi WO* (iii) 22 (iv) 68 (v) 24 (vi) -2 
6 


Q.9 Giventhat L Xi=14 and 5 X? 
i=1 


i=l 


6 
@) = ( Xi-1) (ii) y 
i=l i=l 


Ans: 


G) 8 ” 54 


Q1018 2 Xi=- 24 and z Noe 
i=l 


@) z (2Xi+ 10) 


1=1 
5 
@ii) EX (Xi- 4)? 
i=1 
Ans:(i) 2 (ii) 196 
4 
Q.11 Given 2) Xi= 
4 
(i) as (4Xi + 3Yi) 
4 
(ini) 2% Xi (Yi- 2) 
Ans: 
G@) 50 (ii) 195 


= 44, find 


6 
( 3Xi+2) ii) 2 (Xi+4 


(ii) 252 

124, Calculate 

5 
ii) = i(%i-38) 

5 
(iv) 2D (Xi+5)(%-5) 

i=l 

ii) 396 (iv) -1 


20, e Yi=- 10 and z Xi Yi= 15. Find 


i= 1 


4 

(ii) 2 (H-5) (BYi+ 6) 
res 
4 

(iv) x (Xi+2)(¥i-2) 
i=l 

(a) = 25 (iv) -61 


Q.12 Given that Y, is defined a3 IBM's sales in billions of dollars in year t. The sales 
of the IBM corporation in 1966 — 76 are shown below: 


Year (t) | 1966] 1967 
Sales (Yt) 53 
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Evaluate each of the foliowing expressions: 


1971 1970 1970 1973 
@ Zee ) = (M8) Gi) EZ (Y%-2)2 Gv) E oy? 
t= 1966 t= 1967 (= 1968 t= 1969 


Ans: 
G) 39.4 (ii) 149 Gi) 81.3 (iv) 776.46 
Q.13 Suppose that we have two series, X and Y, which are paired as follows: 


Compute: (i) (2X) (ZY) (ii) EX? - (EX)? (iii) DX?Y - EX? 
(iv) 5 (2X + 8Y) (vy) 2X+Y)(X-Y) (vi) rX2(Y+5) 
Ans: 
@ 1116 Gi) -1010 ii) 425 iv) 165 (vy) 65 (vi) 2141 
Q.14 Given: 


X,=5 X= 10 X,=12 X,=15 


Y,=8 


Calculate: () =X? Gi) SxXy ii) (XK - 72 
(iv) EX(Y-6) (v) say? (vi) 2(X-5)(Y-10) 
(vii) CX+¥Y)(X-Y) (viii) 5 oxy? 
Ans: 
(i) 543 (i) 756 (iii) 102 (iv) 462 
(v) 3171 (vi) 171 (vii) -514 (vili) 25196 
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2 
PRESENTATION OF DATA 


2.1. INTRODUCTION 

The number of observations is usually very large in the collected data. It is 
difficult to work on the observations which are called the raw data. Before we apply 
any statistical tool on the data the observations are put into some condensed form so 
that the statistical work becomes simple. This is called presentation of data. 
Presentation of data includes (i) Classification (ii) Tabulation (iii) Graphic 
Presentation. 

2.1.1. CLASSIFICATION 

The huge amount of data containing individual observations is divided into 
some classes. Each class contains a set of homogeneous observations. The process of 
arranging observations into homogeneous groups or classes is called classification. 
According to L.R. Connor "classification is the process of arrenging things (either 
actually or notionally) in groups or classes according to their resemblances and 
affinities”. 

The classification of the data mainly depends upon the nature, scope and 
purpose of the statistical inquiry. Some characteristics of a good classification are: 
(i) Classification should be unambiguous 

Different classes should be defined clearly. All the classes should be mutually 
exclusive and there should be no confusion about them. If the industrial workers are 
divided into two classes i.e. skilled and unskilled, it should be quite clear as to who 
is a skilled worker and who is unskilled. 

(ii) Classification should be stable 

Suppose we have to carry out a survey about smokers in our country, Who is a 
smoker? It must be defined clearly. Some people are casual smokers and some are 
regular. Definition of a smoker should not vary from place to place or from time to 
time, Thus classification of data should be stable so that the data collected from 
different places on different occasions are comparable. 

(iii) Classification should not be rigid 

Classification should be able to accommodate the new observations or to drop 
the observations which are no more required. 
2,1,2, TYPES OF CLASSIFICATION 

Classification depends upon the characteristics of the statistical data. There 
are two characteristics namely descriptive and numerical. The examples of 
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descriptive chamactéristics are friendship, gender, intelligence and poverty etc. 
Descriptive characteristics cannot be measured. When the data are classified on the 
basis of descriptive statistics, it is known as classification according to attributes or 
classification according to qualitative variable. When the data are classified on the 
basis of numerical characteristics, it is called classification according to class 
intervals or classification according to quantitative variable. 

2,2, CLASSIFICATION ACCORDING TO ATTRIBUTES 


There may be one, two or more attributes and the observations may be put into 
different classes depending upon the number of attributes, 


2.2.1. SIMPLE CLASSIFICATION 


There may be only a single attribute (characteristic) and the data may be 
divided into different classes on the basis of this attribute. Suppose we are 
interested to study the population for literates. Some people are literate and some 
are illiterate. Thus the attribute has two levels (sub-classes). This is called twofold 
division or dichotomy (cutting into two). The attribute may have three levels for 
example (i) highly educated (ii) average education (iii) not educated. When an 
attribute has three levels, it is called 3-fold division or trichotomy. More than three 
levels are also possible. For example different income levels of the people may be 
(i) very rich (ii) rich (iii) average income (iv) low income. The attribute of income has 
four levels (sub-classes). When an attribute has more than 3 levels it is called 
manifold-division. Thus the number of classes based on a single attribute may be 
two or more than two, As long as there is one attribute, the classification is called 
one-way or simple. 

2.2.2. TWO-WAY CLASSIFICATION 

When the data are divided into different classes on the basis of two attributes, 
it is called two - way classification. In two-way classification there are at least 4 
classes. Suppose one attribute is income with two levels-rich and poor and the other 
is education with two levels-literate and illiterate, then the data are divided into 
2x 2=4 classes given below: 


(i) rich and literate (ii) rich and illiterate 
(ili) poor and literate (iv) poor and illiterate 


If one attribute has 3 levels and the other has 2 levels, then there are 3 x 2=6 
classes of the data. If the attribute of income has 3 levels-rich, average income and 
poor and the other attribute education has 2 levels-literate and illiterate, then the 
data are divided into 6 classes as below: 


Gi) rich and literate (ii) rich and illiterate 
(iii) average income and literate (iv) average income and illiterate 
(v) poor and literate (vi) poor and illiterate. 
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If both attributes have 3 levels, then the data are divided into 3 x 3 = 9 classes. 
As long as there are two attributes, the classification is two-way, When their are 
more than two attributes the classification is called manifold. 

) 2.8. QUANTITATIVE DATA 

When the observations are generated by counting or by measurement, the data 
obtained are called quantitative. There are three different methods of presenting the 
quantitative data. 

(i) Series of individual observations. (ii) Discrete frequency distribution. 

(iii) Frequency distribution (Grouped Data) or Classification according to 

class-intervals, 

2.8.1. SERIES OF INDIVIDUAL OBSERVATIONS 

When the data are collected, these are in the form of individual observations, 
These are also called raw data, When raw numerical data are arranged in ascending 
or descending order of magnitude, it is called an array or arrayed data, An array of 
observations is called series of individual observations, Suppose we have asked 10 
students about their marks in a test in English. Their responses are recorded below 
which are called raw data. 


R. No. of students 


2 3 4 5 6 9 10 

Piarerin Basia [30 [ao [30 bao [oo | ao | so [50 [ao [a0 

Raw data when arranged in ascending order become arrayed data or series of 
individual observations. The array is 30, 30, 40, 40, 40, 40, 40, 50, 50, 50. 
2.3.2. DISCRETE FREQUENCY DISTRIBUTION 

When the variable is discrete, the observations obtained are called discrete 
data. Discrete data is generated by counting, therefore each and every observation is 
exact. When an observation is repeated, it is counted. The 'number' for which the 
observation is repeated is called 'frequency' of that observation. The symbol ‘f is 
used for frequency. The symbol X is commonly used for observations. Suppose we 
have raw data as below which are the numbers of students in ten classes of a school. 


[so | s2 [ 30 [ 40 [ 42 [ 40 [ 32 [ 40 T 2 [ 40 | 
There are 2 sections of 30 students each, 3 sections have 32 students, 4 sections 
have 40 students and 1 section has 42 students. This information can be collected in 
the form of table below called discrete frequency distribution. 


Diserete Frequency Distribution of Number of Students 


Number of students Number of sections 
xX f 


30 


32 
40 
42 
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This method of condensation of observations into discrete frequency 
distribution is for discrete data but it can also be used for continuous data as well. In 
actual practice this method is not used when the number of observations is very 
large. For very large number of observations, the observations are put into different 
groups. The grouping can be done for discrete as well as continuous data. 

The symbol 'X' is used for the variable which is recorded initially in the form of 
observations and the symbol 'f is used for the number of times each observation is 
repeated. , 

Example 2.1. 
The number of persons in 20 families in a small village are: 


l2;i{3j2}afe][s{s[sfs[s]a]s[s]als[s[s[4]5] 


Put the observations in discrete frequency distribution form. 


Solution: 
x f 


2.4, FREQUENCY DISTRIBUTION (GROUPED DATA) 

The observations on a continuous variable are not exact. These are rather the 
approximate measurements. We know from the theory of continuous variables that 
an observation (measurement) does not repeat itself as far as theory is concerned. In 
actual practice, the observations are repeated. These measurements are put into 
different classes, each class accommodates a certain number of observations called 
the frequency (f) of the class. The condensation of data into suitable number of 
classes, each class containing the observations which lie within the class, is called a 
frequency distribution or grouped data. The grouped data are also called 
classification according to class intervals. This is very popular way of presenting the 
statistical observations. The observations get condensed into a form convenient for 
further calculations. 

In actual practice the grouped data is used for continuous as well as discrete 
data. For illustration of various terms used in grouped data, we take the weights of 
40 students in kilograms (recorded to the nearest kilogram) and put them in 
grouped data as shown in Table 2,1, The weights are: 


[56 |55|59| 58] 59] 63 [63 [61 [62 [61 [62 [62 [6s | 4] 64] 65 [66 
63] 61] 69]61]58] 69] 65 |64[55 | 62] 63 [4 [59 [62] 5662 [6a 67 66] 65 


[Chapter 2] Presentation of Data 39 


Table 2.1. Entry Table 
Weight é No. of students 
4 


56, 55, 55, 56 
59, 58, 59, 58, 59, 59 
63, 63, 62, 61, 62, 62, 61 
62, 63, 62, 61, 62, 61, 63 
65, 64, 64, 65, 66, 64 
65, 64, 64, 65, 66, 64 
67, 69, 69, 67 


The various terms which are to be explained in connection with the grouped 
data are 
(i) number of classes (ii) class limits and class boundaries 
(iii) size of the interval (class-interval) (iv) class-marks or mid points 
(v) frequency 
2.4.1. NUMBER OF CLASSES ‘ 

There is no hard and fast rule about the number of classes. It is decided on the 
basis of the given data. Clearly, if the observations are spread over a long range of 
the variable, we shall make large number of classes. Keeping in view the relative 
importance of the observations, the observations of large magnitude and small 
magnitude are not put in the same class. If we take a very small number of classes, 
we shall loose much of the information contained in the raw data. In the data about 
heights of students, if the height 60" is combined with the height 72", the 
calculations on the grouped data will differ to a great extent from the calculations on 
the original observations. If we take very large number of classes, it will spoil the 
purpose of classification. Thus we have to make a wise decision about the number of 
classes, The given observations, their nature, their size and their number give us 
some hint to decide the number of classes. Roughly, it can be suggested that the 
number of classes should be between 8 to 15. More than 15 classes may not increase 
the accuracy in calculations and less than 8 classes will mean that we are combining 
the large observations with the small observations with the result that their 
originality will be lost. 

In Table 2.1, there are only 5 classes which are 5b — 57, 58 — 60, 61 — 63, 64-66, 
and 67 - 69. 

The students are advised to make more than 5 classes when they are given 
some data for classification. 

The starting class is 55 — 57. This class will accommodate the observations 
equal to 55, 56, and 57. The classes like 55 — 57, 58 — 60, ... are suitable for discrete 
set of data. But it will not be wrong to use this type of classes for continuous data. 
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For continuous data, the classes are taken as 55 — 57, 57 — 59, 59 — 61, .... Now 57 is 
present in the starting class as well as in the second class. When this is the case, the 
observation 57 is to be recorded in the class 57 — 59 and not in the class 55 — 57. 
Thus if the end point of a class and the starting point of the next class are equal, the 
‘next class' will absorb the observations equal to these end points. 


The word inclusive classification is used for the classes like 55 — 57, 58 — 60, ..., 
67 — 69. Both end points of the class are included in the respective class. 

For the classes like 55 — 57, 57 — 59, ..., the word exclusive classification is 
used. It is exclusive because the observation 57 is excluded from the class 55 — 57. A 
class may be "less than 10" or "more than 90", Such classes are called open ended 
classes. One end of such a class is not specified. Such classes should be avoided 
because there are certain important calculations (for example the mean, the 
geometric mean, the harmonic mean, the mean deviation and standard deviation) 
which cannot be made when the classified data contain some open end classes. 

Some authors use the words 'class' and 'class interval’ interchangeably. We 
have not used the word ‘class interval’ for a 'class’. 

2.4.2 CLASS LIMITS AND CLASS BOUNDARIES 
The class 55 — 57 starts with 55 and ends at 57. The starting point 55 is called 
* lower class limit and the end point 57 is called upper class limit. The class limits 
should be taken such that it is clear as to which observation should be included in 
that class. In continuous data when observations are recorded in whole numbers, the 
obseryation 55, in fact, is a value between 54.5 and 55.5. An observation which is 
54.6 is recorded as 55. Thus 55 starts from 54.5 which is called the lower class 
boundary of the group 55 — 57. The upper limit of the group 55 — 57 is 57. The 
observation 57 covers all observations on a continuous variable up to 57.5. Thus 57.5 
is the actual upper boundary of the class 55 — 57. The class 55 — 57 includes all those 
observations which fall in the expanded interval 54.5 — 57.5. The values 54.5 and 
57.5 are the lower and upper class boundaries of the class 55 — 57. A class boundary 
can be calculated by finding the mean of the upper limit of a class and the lower 
dimit of the next higher class. If the original observations are recorded as whole 
numbers, the class boundaries are given up to one decimal place. It means no 
observation will be equal to any class boundary. Thus there will be no confusion 
whether or not an observation should be included in a ..ass. If the actual 
observations are measured up to one decimal place, the class boundaries will be up 
to two decimal places and soon. The data of Table 2.1. are reproduced here in Table 
2y2. with class boundaries calculated. 
Table 2.2. Class boundaries 


Weight — Class boundaries No. of students 
(kilogram) if 


54.5 —57.5 
57.5 — 60.5 


60.5 — 63.5 
63.5 — 66.5 
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Suppose we have classes as 1.0 - 1.4, 1.5 - 1.9, 2.0 - 2.4 ... with some 
frequencies, the class boundaries are calculated as below: 


0.95 — 1.45 
1.45 - 1.95 


1.95 - 2.45 


The first gxoupihas the lower class boundary equal to 0.95 which is the average 
of 1 and 0.9, wheré.0.9 is the upper limit of a group '0.5 — 0.9' which has not been 
written in the above table. Upper class boundary of the first group is 1.45 which is 
the average of 1.4 and 1.6. The value 1.45 is also the lower class boundary of the 
second group. Similarly 1.95 is the average of 1.9 and 2 where 2 is the lower limit of 
the next class. 

We have to calculate the class boundaries of the grouped data when we have to 

(i) Calculate median, quartiles, deciles and percentiles of the data. 

(Gi) Calculate mode. 

Gii) Make graphs of frequency distributions. 

For the classes 6 — 10, 10 — 15, 15 — 20, ..., calculation of class boundaries is 
not required because the given lower and upper class limits are already the class 
boundaries. For the open ended class "less than 10", the lower class boundary cannot 
be calculated. Similarly for the open ended class "more than 90" the upper class 
boundary cannot be calculated. 

2,4.3. SIZE OF THE INTERVAL ( CLASS INTERVAL ) 

The difference between the upper class boundary and the lower class boundary 
of a class is called the ‘interval’ or 'class interval’ of the class. It is also called the size 
of the interval, width or length of the interval. In Table 2.2, the class boundaries of 
the starting group are 54.5 and 57.5. Thus the interval of this class is 57.5-54.5 = 3, 
Interval for a class can also be calculated by finding the difference between the two 
successive lower class limits or between two successive upper class limits or between 
the two successive mid points. The difference will be the interval for the preceding of 
the two classes. All classes may have the same interval which is called the uniform 
interval. A uniform interval is helpful in statistical calculations. In some special 
cases, interval of unequal length can also be used. Uniform interval can be denoted 
by some symbol 'h' or ‘i’. 


Warning 


Table 2.1. has the classes 55 — 57, 58—60, .... This is an inclusive classification. 
In these classes length of the interval calculated by finding the difference between 
two class limits is wrong. The difference between 57 and 55 is 57 — 55 = 2, whereas 
the interval actually is 57.5 — 54.5 = 3. Thus an interval is the difference betwren 
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two class boundaries and not the difference between two class limits. It is always 
safe to find the difference between two successive lower class limits (or lower class 
boundaries) for finding interval of the preceding of the two classes. When the classes 
are 10 — 15, 15 — 20, we may find the difference between two class limits. Here the 
difference between 15 and 10 is 5 which is the interval of the class 10 — 15, If 
possible the size of the interval should be 1, 2, 5, 10 or any multiple of these digits or 
it should be 0.01, 0.1, 0.2, 0.5. We shall avoid the intervals like 5.1, 2.3, 10.4 etc. We 
prefer the whole numbers as intervals. 
2.4.4, CLASS MARKS OR MIDPOINTS 

For a certain class, the class mark or midpoint is calculated by finding the 
mean of the two class limits or the mean of the two class boundaries. It makes no 
difference whether we calculate it from the two class limits or the two class 
boundaries, In Table 2.1, we have the classes with class limits ie., 55 — 57, 58 — 60, 

: Artes 55 +57 58 +60 

««. Their midpoints are 2 756 and 2  ~ 959. In Table 2.2, we have the 


classes with class boundaries 54.5 — 57.5. 57.5 — 60.5, .... The midpoints are 


4.5 + 57. hs § 
BAS +526 = 56 and S75 +605 = 59 which are same as calculated from the class 


limits. The class mark of a class represents all the observations of that class. It is a 
representative figure of the observations of its class. If it is the average or close to 
the average of the observations lying in that class, then the calculations on the 
grouped data will be close to the calculations on the actual raw data. 

2.4.5, FREQUENCY 

Frequency of a class is the number of observations following in that class. 
There are two methods of recording the observations against the respective classes. 
These methods are: 

@ _ By listing the actual observations (entry table) 

(ii) By using a tally-column (tally sheet) 

In table 2.1. where the observations have been listed against their respective 
classes the first method is used. The observations against each lass are counted and 
this count is recorded as frequency (f) of each class in a colur'n having heading as 
frequency. In this table the frequencies of the classes are 4, 6, 14, 12, 4 under the 
heading 'number of students’. 

Jn Tally Sheet method, we put a stroke or a mark '/' or vertical line or bar as 
'I' for each observation against the respective class under the column headed Tally 
Bars, Four marks or strokes like //// show that 4 observations have gone toa 
class. The 5th observation is recorded by crossing these 4 strokes to get 
¥¥*# which stands for a group of 5 observations. When all the observations have 
been marked, we write the total of the strokes corresponding to each class in the 
column headed by frequency. Groups of five are convenient for counting. The 
observations in table 2.1. are presented in table 2.3. with the help of tally sheet. The 
class boundaries and the midpoints are ~'so given. 
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Table 2.3. Tally Sheet 
Frequency Distribution of Weights of 40 Students 


Weights Class Frequency 
Re en ee 


54.5 - 57.5 
57.5 — 60.5 
60.5 — 63.5 
63.5 — 66.5 
66.5 — 69.5 


2.4.6. FORMING A FREQUENCY DISTRIBUTION 


Grouped data are used for presentation of continuous data but if the discrete 
data are very large, they are treated as continuous for the sake of convenience of 
calculations. Hence the discrete data can be grouped in the same way as the 
continuous data and rest of the calculations are similar for both types of data. For 
making a frequency distribution we adopt the following procedure: 

(@ First of all we find the range of the data. The range is the difference between 
the maximum and the minimum observation. 

(ii) We decide the approximate number of classes into Which the data are to be 
grouped. Suppose the range is 65 and we want to make about 12 classes. The 
range is divided by the number of classes. We get 65/12 = 5.4, which is the 
approximate size of the interval of the classes. We usually avoid intervals like 
5.4. We can use interval of 5 or 6. If we take 5 as interval, number of classes 
will increase, Thus if interval is 5, the number of classes will be 65/5 = 13. If we 
take 6 as length of interval, the number of classes will be less than 12 because 
65/6 = 10.8 which means 11 classes. Thus we can take a start with approximate 
number of classes and then we can adjust the interval so that it is preferably 
an integer like 1, 2, 3, 4, 5, 10, 20, 25 or 50 or any multiple of these integers. If 
the interval is less than 1, we shall take interval equal to 0.01, 0.05, 0.1, 0.2, 
0.4, 0.5 or 0.8. We can also take a start by taking a reasonable size of the 
interval and then the number of classes can be determined. 

(iii) When the number of classes and the interval has been decided, then we write 
the class limits, The lowest class usually starts with a number which is a 
multiple of the interval. The class limits should be written so that the smallest 
observation is absorbed in the starting class and the largest observation is 
absorbed in the last group. The open ended classes should be avoided. 

(iv) All the observations are put into respective classes. This may be done by using 
‘tally column' method which is suitable for tabulating the observations into 
respective classes. The totul of the tally bars is calculated to get the frequency 
against each class. The frequencies of all classes are noted to get grouped data 
or frequency distribution of the given variable. The total of the frequency 
column must be equal to the number of observations. 
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Example 2.2. 
Arrange the following marks in a frequency table taking a suitable size of 
interval by: (i) Entry table (ii) Tally sheet. 
[13] 81 [58 [81] 35] 
|72| 84] 91] 87] 76] 43] 83] 40| 73 | 86 [73] 
| 29 | 43] 85] 42/80] 75] 85/62] 57] 64[ 70, 


L73 | [34 
Solution: 


Here, highest observation = 96, lowest observation = 13, Range = 96 — 13 = 83. 
If we take an interval of 10, we shall have 9 classes which is a reasonable number. 
The minimum observation is 13. Let us take the lowest class as 10 — 20. The 
frequency distribution is obtained as below: 
i) By Entry Table 


13 
29, 22 

33, 36, 33, 30, 34 

43, 40, 43, 43, 42, 49, 47 


58, 59, 58, 57, 57, 57, 50, 55, 57, 53 10 
61, 67, 65, 62, 62, 61, 65, 62, 64, 64, 69, 69, 63, 69 14 
75, 70, 75, 72, 76, 73, 73, 76, 73, 77, 72, 72, 75, 70, 20 
74, 78, 73, 73, 78, 70 

81, 81, 85, 84, 81, 87, 84, 87, 83, 86, 85, 80, 85, 15 
80, 85 


91, 95, 95, 92, 96, 91 


VATS SS 


VAS AAS 10 
VIIA AISA ASS 4 
a 20 
i 15 


AAAS S 
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If frequency of a class is divided by the sum of frequencies we get what is called 
a relative frequency. If we calculate the relative frequencies for all the classes, we get 
the relative frequency distribution. The total of the relative frequencies is equal to 1. 
If the frequencies in Table 2.1. are converted into relative frequencies, we get relative 
frequency table or relative frequency distribution. Table 2.4. gives the relative 
frequency distribution of weights of 40 students. 


Table 2.4. 
Relative Frequency Distribution of Weights of 40 students 
a A pe 
Weights | Frequency Relative frequencies 
(kilograms ) 
4 
56 — 57 4 40 = 0.10 
58-60 | 6 SL 0.15 
40 Ms 
14 
61-63 14 40 = 0:35 
12 
64-66 12 105 0.30 
67 4 sete 10 
-69 40 7 0. 
Total 40 | 1,00 


The relative frequencies are also called proportions and in discussion on 
probability we shall call them probabilities of the classes. The idea of relative 
frequencies is helpful in understanding the basic lessons on probability. It is also 
used in the normal distribution and other probability distributions where the total 
area under the curve is unity. 


2.4.8. PERCENTAGE FREQUENCY DISTRIBUTION 

If a relative frequency is multiplied by 100, we get percentage relative 
frequency. If all the relative frequencies are converted into percentage relative 
frequencies, we get percentage relative frequency distribution or simply percentage 
frequency distribution. The relative frequencies of Table 2.4, are converted into 
percentage relative frequencies which are given in Table 2.5. 
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Table 2.5. 
Percentage Frequency Distribution of Weights of 40 students 
Weights Frequency Percentage frequencies 
(kilograms ) 
Fee SS Sr ee) ee 
55-57 4 x 100 = 10 % 
6 
58-60 6 J * 100 = 15 % 
61-63 14 14 100 = 35 % 
40 
12 
64 - 66 12 Fo * 100 = 30 % 
40 
67-69 4 | 4» 100 = 10 % 
Total 40 100 % 


2.4.9. CUMULATIVE FREQUENCY DISTRIBUTION 


For cumulative frequency distribution, the class limits are converted into class 
boundaries. Cumulative frequency of a class is the total of all frequencies up to that 
class, Thus in Table 2.5. cumulative frequency of the class 57.5 — 60.5 is 4+ 6 = 10 
and the cumulative frequency of the class 60.5 — 63.5 is 4+.6+14= 24, This means 
that there are 10 observations less than 60.5 and there are 24 observations less than 
63.5. These are called ‘less than’ cumulative frequencies. If we calculate the 
cumulative frequencies from the bottom, we get what are called "more than" 
cumulative frequencies. Thus there are 4 observations more than 66.5, there are 
4+ 12 = 16 observations more than 63.5 and there are 4 + 12 + 14 = 30 observations 
more than 60.5. The “less than" and "more than" cumulative frequencies of Table 
2.1. are given in Table 2.6(a) and Table 2.6(b) respectively. 

Table 2.6(a) 
Less than Cumulative frequency distribution of weights of 40 students 


Weights Frequency | Class boundaries Weights ‘less than' 
‘less than’ | cumulative 
frequency 
54.5 —57.5 less than 57.5 4 
57.5 - 60.5 | lessthan60.5 | 4+6 = 10 
60.5 — 63.5 | less than 63.5 | 10+ 14 = 24 
63.5 — 66.5 | less than 66.5 | 24+12 = 36 
36+4 = 40 


66.5 - 69.5 | less than 69.5 
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Table 2.6(b) 
‘More than' cumulative frequency distribution of weights of 40 students 


|" Weights | Frequency | Class boundaries | Weights | ‘more than’ | 
‘more than' cumulative 
frequency 
65-57 4 64.5 - 67.5 more than 64.6 | 86+4 = 40 
58-60 6 57.5 - 60.5 more than 57.5 | 30+6 = 36 
61-63 14 60.5 - 63.5 more than 60.5 | 16+ 14 = 30 
64 — 66 12 63.5 — 66.5 more than 63.5 | 4+12 = 16 
67-69 4 66.5 — 69.5 more than 66.5 4 


Ute more than 66.5) 

When it is not mentioned ‘less than! or 'more than' cumulative frequency then 
it always means the ‘less than’ cumulative frequency. The cumulative frequency 
distribution gives us information about the number of observations which are less 
than or more than some particular point. 

The percentage frequency distribution can also be converted into cumulative 
frequency distribution. It is called cumulative percentage frequency distribution. 
Percentage frequency distribution of Table 2.5. converted into cumulative 
percentage frequency distribution is given in Table 2.7. 


Table 2.7 
Cumulative percentage frequency distribution of weights of 40 students 
Weights | Frequency Class Percentage Weight Cumulative 
boundaries frequency ‘less than' | percentage 
frequency 
55-57 4 54.5 - 57.5 4 * 100 = 10 % | less than 57.5 10% 
58 — 60 6 57,5 - 60.5 < * 100 = 15 % | less than 60.5 25% 
14 : 
61-63 14 60.5 ~ 63.5 40 * 100= 35 % | less than 63.5 60 % 
* 12 
64-66 12 63.5 — 66.5 40” 100 = 30 % | less than 66.5 90 % 
67-69 4 66.5 - 69.5 4 * 100 = 10 % | less than 69.5 100 % 
Total 40 - 100 % = - 
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It may be mentioned here that the cumulative frequencies of Table 2.6(a) can 
also be converted into percentage frequencies. Each cumulative frequency of Table 
2.6(a) is divided by 40 and multiplied by 100 to get the result given in the last 
column of Table 2.7. 

Thus it makes no difference whether the frequencies are first converted into 
percentage frequencies and then cumulated (as is done in Table 2.7) or the 
cumulative frequencies are calculated first and are then converted into percentage 
frequencies. The students are advised to convert the cumulative frequencies of Table 
2.6(a) into cumulative percentage frequencies. 

2.4.10, BI-VARIATE FREQUENCY DISTRIBUTION 

We have discussed the frequency distributions of a single variable. Such 
frequency distributions are called univariate which means a single variable. 
Sometimes we have observations in pairs and each pair is obtained as a result of 
observations on two variables. For example the marks of mathematics and statistics 
for a student is a single pair of observations. The marks of another student in two 
subjects will make another pair of observations. Suppose we have the ages of 65 
husbands and their wives given below:- 


pevdvemt| unl lata =) a aaa 
is ofvves [20 [8 | [| [oe | 
SIENNA 


Ages of husbands | 29 | 29 | 25 | 25 


fas] 2 0 | FI} 
heen] 25 | [20 [aa] 12] [27 se 
hewetvien [so] [oo [oa [0] 2 [ao [0 


The ages of husbands and wives are put into 5 groups with classes as 15 — 18, 
19 — 22, 23 — 26, 27 — 30 and 31 — 34. The ages of husbands are given in the columns 
and the ages of wives in rows. An observation (16, 20) means the age of husband is 
16 and the age of his wife is ZC. This observation is recorded under the first column 
'15 — 18' and against the second row 19 — 22. At the intersection of 1st column and 
2nd row the observation is recorded in a cell by a stroke '1'. All the observations are 
assigned to respective cells and then we add the strokes in rows and columns to get 
frequencies in rows and columns. The observations classified according to this 
method are shown in Table 2.8. 
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Table 2.8. 
Ages of husbands 
Classes | 15-18 | 19-22 | 23-26 ] 27-80 | 31-84 ] Total 


i 
i] | | 


HH HH IIL 
Ml 


The bivariate frequency distribution obtained from Table 2.8 is given in Table 2.8(a). 


Table 2.8(a) 
Bivariate frequency distribution of ages of husbands and wives. 
Ages of husbands 
Classes | 15-18 

15-18 3 2 4 _ _ 
Ages 19-22 3 8 3 1 apt pel 
of | 23-26 1 3 We} 9 — | 
Wives | 27-30 _— — 3 snglda ce kS) | 1 | 

Total Talk dpe BES: 24 20 10 4] 654 


2.5, TABULATION 

After collection of raw data the data are divided into different classes on the 
basis of the characteristics under study. There may be one or more characteristics 
under study. The classified data, whether it be qualitative or quantitative is 
arranged in different rows and columns. This orderly arrangement of data in 
columns and rows is called Tabulation. It is the presentation of data in a manner 
which is suitable for further statistical work and statistical record. For publication, 
the huge mass of data are presented in the form of suitable tables. Tremendous 
amount of data regarding population of Pakistan are presented’ in the form of 
different tables. For such data, it is not possible to do condensation so that these can 
be presented in a single table. Sometimes a large number of tables are required to 
highlight different aspects of the data. The size and number of tables also depend 
upon the volume of data and the number of variables and attributes being studied. 
Tabulation is a stage where data are ready for reading, for quick understanding, for 
publication and for further statistical work. 
2.5.1. TYPES OF TABULATION 

Sometimes the data are stored in the books, magazines and other publications 
for future reference, These tables are usually large in size and are called general 
purpose tables. Some tables contain the data for analysis and research. These tables 
are usually small in size and are called specific purpose tables. 
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When the qualitative or quantitative raw data are classified according to one 
characteristic or one variable, the tabulation of different groups is called single or 
one-way. The tabulation is two-way or double when there are two characteristics or 
two variables. Similarly the tabulation may be manifold or complex when the data 
are divided into different categories on the basis of more than two criteria. 
2.5.2. ONE-WAY TABULATION 

When the data are tabulated into different categories according to one 
characteristic, the tabulation is called simple or one-way. It is good time to mention 
a common mistake about one-way tabulation, About one-way tabulation, the 
impression is that this tabulation has only two groups of data like intelligent and 
non-intelligent, smokers and non-smokers, literate and illiterate. The data having 
two groups is though one way tabulation but more categories under the same 
characteristic are also pdssible and the table will still be simple or one way. In Table 
2.8 the qualitative data are given in one way tabulation. There are 1000 workers 
working in a factory. They are divided into two groups, skilled and unskilled. 

Table 2.8. One-Way Table 


| No. of skilled workers ] No. of unskilled workers | Total 
a 600 | 400 | 1000 
The workers may be divided into more than two categories. Table 2.9 is 
another example of one-way table. 
Table 2.9. One-Way Table 
No. of skilled No. of semi-skilled | No. of unskilled Total 


workers | workers workers 
600 | 500 200 1300 
In Table 2.10, the quantitative data are given in a one-way table. 
Table 2.10. One-Way Table 


Wages of workers No. of workers 


— 


2000 — 2100 70 

2100 — 2200 120 
2200 — 2300 250 
2300 — 2400 280 
2400 — 2500 200 


2500 and above 


Tables 2.8. and 2.9. are about a single characteristic that is skill of the job and 
table 2.10. is about a single variable ‘income’. All these three tables provide answer 
to a single question. Is the worker skilled? Answer is yes or no. What is the income 
of the worker? Answer is any one income group. Table 2.10. does not show whether a 
worker is skilled or not. 
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2.5.8. TWO-WAY TABULATION 


When the data are classified according to two criteria, the tabulation is called 
two-way or double-tabulation. Table 2.11, shows two-way table on the data about the 
workers in a factory. 
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Table 2.11. Two-Way Table 


2000 — 2100 


Wages of workers 


No. of skilled 
workers 


No. of unskilled 
workers 


2100 — 2200 120 
2200 — 2300 250 
2300 — 2400 280 
2400 — 2500 200 
2500 and over 80 


Two questions about a worker can be answered from the two-way table. One 
question about professional skill and the other about the income of the worker. 


2.5.4. THREE-WAY TABULATION 


When there are three variables or characteristics, the data is put into three- 
way tabulation Table 2.12 gives the data of factory workers in three-way table 
according to their wages, their professional training and their religion. 
Table 2.12. Three - Way Table 


Wages of workers skill wise and religion wise 


Number of skilled workers Number of unskilled workers Grand 
Non-Muslims Muslims | Non-Muslims Total 
2000 ~ 2100 30 70 
2100 - 2200 40 120 
2200 - 2300 130 100 250 
12300 - 2400 140 130 280 
2400 - 2500 200 
2500 and over 80 
Sub-Total 
1000 
Grand Total 


A three-way table can provide answers to three different questions regarding 
the observations with data. For example the three questions may be about income, 


religion and skill.’ 
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Main parts of a statistical table are the title, the box-head, the stub, the body, 
prefatory note, footnote and the source note. 


Let us sketch here Table 2.13 without figures to explain various sections of the 
table. 
Table 2.13. 
Wages of workers «———— Title 
Prefatory note ___ Box-head 


L 
No. of skilled workers No. of unskilled workers Grand 
Muslims 


Wages 


Stub—> 


Foot note. 

Source note. 

(i) Title ‘ 

A clear-cut heading of the table is called its title. It is said that the title should 
answer the questions like what, where, how classified and when about the data. 
Main title should be in capital letters. Different parts of the heading can be 
separated by commas but no full-stop should be used in the title. 

(ii) Box-head 

The headings and sub-headings in columns are called column captions. The 
space where these captions (column headings) are written is called box-head. The 
captions should be clearly written in prominent letters. First column should contain 
more important characteristic. Only the first word in each column should be in 
capital letters. 

(iii) Stub 

The heading and sub-heading of the rows are called row captions and the space 
where these headings are written is called stub. The stub should contain clear 
headings. 

(iv) Body of data 

This is tt main part of the table. Below the column headings and against 
the row marking, we write the figures. It is the main part of the table and contains 
the information regarding different groups. 
(vy) Prefatory Note and Footnote 

Sometimes the additional notes are given on the table to explain various points 
which are otherwise not clear. Prefatory note is given below the title to indicate as to 
which units have been included and which have been ignored. This note is written 
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below the title and above the box head. Footnote is used to clarify anything in the 
table. A footnote may be about columns, rows or the body. Footnote is at the end of 
the table. " 


(vi) Source Note 
The source note is given at the end of the table. It includes the information 
about compiling agency, publication, date of publication etc. 
2.5.6. GENERAL RULES OF TABULATION 
‘There are certain rules which should be followed in tabulation. 
(i) A table should be simple, easy to understand. There should be no need to go 
through footnotes or explanations. 
(ii) If the observations are large in number they can be broken into different 
groups and more than one table can be prepared. 
(iii) Proper and clear headings for columns and rows should be used. 
(iv) Thick lines should be used to seperate the data under big classes and thin lines 
to seperate the sub-classes of data. 
(v) The units of measurement under each heading and sub-heading must be 
indicated. 
(vi) Totals of columns should be preferably at the bottom of the table and the totals 
of rows should be at the extreme right side of the table. 
It is not possible to make rigid rules for tabulation. In general tables should 
suit the needs and requirements of an investigation. 
2.6. GRAPHIC REPRESENTATION 
Statistical tables contain data in the form of figures. But numerical figures are 
usually not attractive and some people find it difficult to get a clear picture from the 
numerical data. A more attragtive method of presenting the data is to make good 
looking diagrams and graphs, There are very large number of graphs, pictures and 
diagrams which are used to represent data. Good looking diagrams and pictures 
leave an everlasting impression on the mind of the observer. The modern advertising 
is mainly based on pictures of various types to attract the consumers. 
2.6.1. DIAGRAMS 
There are different types of diagrams used in representing data. We shall 
discuss here only the bar diagrams, rectangles and pie diagram. 
2.6.2. SIMPLE BAR DIAGRAMS 
Simple bar diagrams are made to represent geographical, historical, numerical 
and the qualitative data, The vertical or horizontal bars are made to represent the 
data when the difference between different quantities is not very large, The diffcrent 
quantities may be arranged in ascending or descending order but the time series 
data are not arranged. The height of the bars is in proportion to the size of the 
quantities, All the bars are of uniform width. The space between the bars should not 
be more than the width of the bars. The bars should be neither very tall nor they 
should look small statured. Very thick or very thin bars will not look attractive. 
Vertical bars are used for time series and quantitative data and horizontal bars are 
used for geographical data. 
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Example 2.3. 
Draw simple bar diagram to represent the profits of a bank for 5 years. 


[Prot (aionmay [10 [a | | as [a] 


Simple bar diagram showing the profits of a bank for 5 years 
Profit 


Fig.2.1. 


1989 1990 199] 1992 1993 
Years 

2.6.3. MULTIPLE BAR DIAGRAMS 

Multiple bar diagrams are made when the data are in the form of different 
groups and each group contains two or more pieces of information. Within the group, 
the different quantities are represented by adjacent bars. One set of bars is 
seperated from another by a suitable space between them. Imports and exports of a 
country may be shown side by side for d'fferent months. Birth and death rates of a 
country over a number of years may be shown side by side for different years. 
Example 2.4, 

Draw suitable diagram for the data of birth rates and death rates of following 
countries. 


Country | Birth rate | Death rate 


Pakistan 
India 


Britain 
Egypt 
Japan 
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The given,data are suitable for multiple bar diagrams. Thus the multiple bar 
diagram is drawn below: 
Multiple bar diagram showing birth and death rates in some countries. 
Rates Birth rately 


Fig.2.2. 
aul Death rat 


Pakistan India Britain Egypt Japan 
2.6.4, SUB-DIVIDED BARS OR COMPONENT BAR CHART 

Component bar charts are made when we are interested to compare the totals 
of some groups of data and the comparison within the group is also desired. Each bar 
represents the total of some components and each component is shown as a 
proportional section of the bar. In this way the quantities within the group and 
between the groups are compared. The bars are seperated by suitable space between 
them, 
Example 2.5. 

Represent the following data about the number of sheep and goats (in 
hundreds) in various divisions of Punjab during the year 1982 — 83 by component 
bar chart. 


No. of sheep (00) No. of goats (00) 

Bahawalpur 1583 2250 
D.G. Khan 7168 1181" 
Faisalabad 1311 3122 4433 
Gujranwla 1495 2265 3760 
Lahore 4262 | 5777 10039 
Multan 1434 3778 6212 
Rawalpindi 372 | 3415 3787 
Sargodha__| 448 1048 feeviage 


Source: Punjab Development Statistics, 1984 , 
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Component bar chart 


Number 


Fig.2.3. 


10000 
' 


5000 
4000 


3000 


2000 | | 
1000 - 


0 


Sheep 


Goats 


D.G. Khan 


Bahawalpur 
Gujranwala 


Sargodha 


2.6.5. RECTANGLES 


The rectangle is four sided geometrical 
shape with some length and height. The areca 
of the rectangle is given by the product length 
x height. Rectangles are drawn for those 
quantities which differ a lot from one 
another. Rectangles are area diagrams 
Rectangle has two dimensions, it is théreforc 
possible to keep one of the dimensions 
constant for all rectangles and the other in 
proportion to the area of the rectangle. 
Suppose a factory has 1000 workers out of 
which 600 are skilled and 400 are unskilled. 


Faisalabad 


Multan 
Lahore 


We nave to show the skilled aud unskilled workers in rectaing!-s. Te areas of the 
two rectangles should be in the proportion G00: 400 or 3: 2. The two rectangles with 
equal base (breadth or width is constant) and the height in proportion to 3 : 2 are 
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shown in Fig.2.4. These two rectangles look like thick bars. Thus the reclangles with 
equal width are nothing but simple thick bars. We can make rectangles with the 
same height but length (base) proportional to the ratio 3: 2, The rectangles with the 
same height are shown here in Fig.2.5. Their lengths (base) are in the ratio 3: 2, We 
can also make rectangles in which both the lengths and heights are different but 
this type of diagrams are not popular. Rectangles are not very popular liagrams bul 
sub-divided rectangles are very popular 
Fig.2.5. 
Skilled workers Unskilled workers 


2.6.6. SUB-DIVIDED RECTANGLES 

The sub-divided rectangles are drawn to represent two o1 more groups of data 
when each group consists of some components and these components are also to be 
compared. The sub-divided rectangles are usually drawn to compare the budgets of 
various families. Each component of a group is converted into the percentage of the 
corresponding total. We draw one rectangle for each total taking equal length (100 
units) and breadth is taken proportional to the totals. Each rectangle is divided into 


different parts which represent different items of expenditure in family fiudgets. 


Example 2.6. 

Compare the budgets of family A and B with a suitable diagram. 
Items of expenditure Family A | Family B 
Food 2500 4000 
Clothing | 800 1000 
House rent 1200 2000 
Educati n \ 800 j 600 
Miscellaneous 700 | 1400 
Total 6000 | 9000 


ee 


Total expenditures are 6000 and 9000 which have the ratio 1: 1.5, The height 
of the two rectangles is the same and their breadth is in the ratio 1: 1.5 
Expenditure on each item is converted int. the percentage of its total. The 


calculations are made here and the sub-divided rectangles are shown in Fig.2.6 


Actual 
expenditure 


Item of 
expenditure 


Percentage 
expenditure 


Food 2500 * 100 = 41.7% 
Clothing 800 an * 100 = 13.3% 
House rent 1200 = * 100 = 20% 

Education 800 an * 100 = 13.3% 


700 
6000 * 100 = 11.7% 


Miscellaneous 


Percentage Fig.2.6. 
Family Family 
A B 
1005 
805 
60 a! 


204 


0 
Rs. 6000 


2.6.7. PIE DIAGRAM 


Rs. 9000 
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4000 * 100 = 44.4% 
aa * 100 =11.1% 
2060 * 100 = 22.2% 
oe x 100=6.7% 
tee, 100 = 15.6% 


9000 


Miscellancous 


Education 


House rent 


Pie diagram is a circular diagram where the whole circle represents a ‘total’ 
and the components of the total are represented by sectors of the pie diagram. Pie 
diagram is also called sector diagram. It is a popular diagram and is drawn when 
the components are to be shown for comparison, The total angle of the circle is 360° 
and the total quantity to be represented is taken equal to 360°. The angles for each 
component are calculated and these angles are made in the circle to show different 


components. 
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Example 2.7. 
The data on Agricultural Product at current factor cost for Pakistan for the 
year 1983 — 84 is given below. Make a pie diagram to represent the data. 


Major crops 46231 

Minor crops 
Livestock 
Fishing 
Forest 


Source: Punjab Development Statistics, 1984. 
The necessary calculations to make the Pie diagram are shown below and the 
diagram is shown in Fig.2.7. 


Sub-sectors Agriculture Angles of a sub-sectors 
Product ( million Rs.) 
4 
Major crops 46231 aes 360 = 181.2 
1497 
Minor crops 14971 Aue 360 = 58.7 
Livestock 27096 27086 . 360 = 106.2 
salt 30. 
Fishing 3082 rors 860 = 12.1 
457 
Forestry 457 91837 x 360 = 18 
Total 91837 360 


Pie Diagram showing product of agricultural sub-sectors for 1983 — 84 
F i 


Major crops 


Forestry 


Minor crops Fishing 


Livestock > 
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2.6.8. GRAPH OF TIME SERIES 

A graph of time series or historical series is called historigram. The time is 
taken on X-axis and the variable is taken on Y-axis. The plotted points are joined 
together to get the graph called historigram. 
Example 2.8. 

The number of slaughter houses in Punjab for the period 1975 — 76 to 
1982 — 83 are given below. Represent the data with a historigram. 


i eee ere 

Years : Years 

houses houses 

[|_i978=79 [1569 [982-28 

Source: Punjab Development Statistics, 1984. 

The given data is shown in a Historigram in Fig.2.8. 

Historigram: Showing number of slaughter houses in Punjab from 
1975 — 76 to 1982-83 


Fig.2.8. 


2400, 


Number of slaughter houses 
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Years 


2.7, GRAPHS OF FREQUENCY DISTRIBUTIONS 

The statistical data are mostly presented in the form of graphs. Some of the 
important praphs of frequency distiib. ons are 

G) Histogram (ii) Frequency polygon 

(iii) Frequency curve lv) Cumulative frequency polygon. 
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2.7.1. HISTOGRAM 

Histogram is a graph of the frequency distribution in which classes with class 
boundaries are taken on X-axis with a suitable breadth of class and adjacent bars 
are erected to show the frequencies. The height of the bars is in proportion to the 
size of the frequency. For uniform intervals, we take a suitable breadth for classes. 
For unequal intervals we have to adjust the frequency. If the interval becomes 
double, then frequency is divided by 2 so that the area of the bar is in proportion to 
the areas of other bars. Histogram is a very simple and very important graph of the 
frequency distribution. This graph makes the base for other graphs. If we take the 
frequencies on Y-axis, we get frequency histogram, the total area of which is equal to 
the total frequency. If we take relative frequencies on the Y-axis, the total area of 
the histogram is unity. If we take the percentage frequencies on Y-axis, we get 
percentage frequency histogram, the total area of the histogram will be 100, 


Example 2.9. 
Draw histogram of the following frequency distribution. 
Classes ' 65-57 58 - 60 61-63 64 — 66 67-69 
Frequency | 4 : 6 4 | 12 4 


We write the class boundaries in the following table and the Histogram is shown in 
Fig.2.9. 


Groups F Frequency | Class boundaries 
55-57 4 54.5 — 57.5 
58-60 6 57.5 — 60.5 
61-63 | 14 60.5 — 63.5 
64-66 12 | 63,5-66.5 
67-69 | 4 66.5 — 69.5 
Total } j AO 3s 
164 Fig.2.9. Histogram 


54.5 — 57.5- 60.5 - 63.5 665— 69.5 
Class Boundaries 
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2.7.2. FREQUENCY POLYGON 

Frequency polygon is a graph of the frequency distribution in which the 
frequencies are plotted against the midpoints of the classes. The plotted points are 
joined together to get the frequency polygon. 
Example 2.10. i 

Make frequency polygon of the data. 
[ Frequency | 4 | 8 10 20 15 8 2 


6 


The midpoints of the classes are taken on X-axis and the frequency is plotted 
on Y-axis. 
The graph is given below in Fig.2.10. 


Fig.2.10. Frequency Polygon 


——> Frequency Curve 


125 17.5 22.5 27.5 32.5 37.5 42.5 
Midpoints 


75 


2.7.3. FREQUENCY CURVE 

In frequency curve the points are not joined together by straight lines. The 
free-hand drawing method of drawing curve is used and we get the frequency curve 
as shown in fig.2.10. We can draw the frequency curve on the frequency polygon or 
we can draw the curve on the seperate sheet of paper. 


2.7.4, CUMULATIVE FREQUENCY POLYGON 
In cumulative frequency polygon, the cumulative frequencies are plotted 
" against the upper class boundaries, This graph can be used to interpolate the values 


of median, quartiles and other partition values. The word Ogive polygon is also used 
for cumulative frequency polygon. 
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Example 2.11. 


Make cumulative frequency polygon of the given data. 


|Groups 


2 | 


10-14 | 15-19 | 20-24 | 25-29 | 


Frequency 


5-9 
10-14 
15-19 
20 - 24 
25-29 


10 


The necessary calculations are given below: 


T 
15 


4,6 =9.5 ‘10 
9.5 ~14.5 25 
14.5 - 19.5 45 
19.6 ~ 24.5 15 
24.5 - 29.5 | 93 


. 


Cumulative frequency 
a 
i) 


Cumulative Frequency polygon. 
Fig.2.11. 


4.5 


9.5 14.5 19.5 24.5 29.5 


Upper class boundaries 
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SHORT DEFINITIONS 
Classification 


The process of grouping a large number of individual facts or observations on the 
basis of similarity among the items is called classification. 

or 
The process of arranging data in groups or classes according to-resemblances and 
similarities is called classification 


Simple Classification 


When data are classified according to a single characteristic, it is called simple 
classification or one-way classification. 


Raw Data 
A list of the data in the form in which these are collected, is called raw data. 
Array 
An array is a listing of the data in order of numerical magnitude. 
or 


A list of the observations on a variable, either in ascending order or descending 
order. 


Frequency Distribution 

A frequency distribution is a table showing the number of items in each class. 
or a 

A frequency distribution is a listing of classes and their frequencies. 

Disercte Distribution 


A distribution formed by a set of values which are of discrete nature is called a 
discrete distribution. 


Continuous Distribution 

A continuous distribution can assume an infinite number of values within a specific 
range. 

Grouped Data 


Data available in class intervals and summarized in form of a frequency 
distribution. 


Class Limits 
The smallest and largest values in any given class are called its class limits. 
Class Boundaries 


The class boundaries are obtained by increasing the upper class limits and 
decreasing the lower class limits by the same amount so that there are no gaps 
between consecutive classes. 


WwW 


[Chapter 2] Presentation of Data 65 
Class Width Les cll, 


The class width is the difference between two consecutive eg class. limits or 
between}\the upper and os class boundaries of any class. ( (te 


Mass shady ot igo ie aca tie Ufet ang Lowe c hos Lous lan 
frets point in each Stes that is is Ifway between the lower and upper class limits, is 
called midpoint of that class. cule, Mog 


Relative Frequency 


The relative frequency for a particular class is equal to the class frequency divided 
by the total number of observations. 


*« Percentage Frequency 


Percentage frequency is computed by dividing the frequency in a category by the 
total number of observations and multiplying by 100. 

Cumulative Frequency 

The cumulative frequency is the number of observations less than or equal to a given 
value of the variable. 


or 


r+ panne frequency is the sum of the frequencies for several consecutive classes 
f 


of a frequency distribution. 
Bivariate Frequency Distribution or Joint Frequency Distribution 


A summary of a bivariate set of data that displays the number of observations of 
respective joint characteristics of one value taken from each of the variables that 
define the data set, is called a bivariate frequency distribution. 


Tabulation 
Tabulation is an orderly arrangement of data in columns and rows. 
Pie Chart 


Pie chart is a graphical device for presenting qualitative data summaries based on 
subdivision of a circle into sectors that correspond to the relative frequency for each 


class. eon x dML- 
Vor 
A graph in the shape of a circle that is divided into “slices” corresponding to the 


categories or classes to be displayed. The size of the slices is proportional to the 
magnitude of the displayed variable associated with each category or class is called a 
pie chart. 
Histogram 
A histogram is a vertical bar chart in which the rectangular bars are constructed at 
the boundaries of each class. 

or & 


A graphic presentation of data by adjacent rectangles in which each rectangle 
having area proportional to class frequency is called histogram. 
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Frequency Polygon 


A graphic device for illustrating the frequency distribution in which the frequencies 
dre plotted against class marks and successive points are connected with the help of 
straight lines. 


Frequeacy Curve 


A form of graph, representing a frequency distribution, in which a continuous line is 
used to indicate the general trend of frequencies. 


Cumulative Frequency Polygon or Ogive 


The graphical display for a cumulative frequency distribution with upper class 
boundaries is called a cumulative frequency polygon or ogive. 


Cumulative Frequency Curve 


Acurve that shows the number of cases below the upper true limit of an interval, is 
called a cumulative frequency curve. 


Historigram 
A graph of time series or historical series is called historigram. 
Skewed Distribution 


A distribution that departs from symmetry and tails off at one end is called a skewed 
distribution. 


or 


A distribution in which the observations are concentrated at one end of the 
distribution is called a skewed distribution. 


Positively Skewed Distribution 


A distribution that has relatively fewer frequencies at the higher end of the 
horizontc! exis, is termed as positively skewed distribution. 


Negatively Skewed Distribution 


A distribution that has relatively fewer frequencies at the lower end of the 
horizontal a ds. 


Normal Curve 
A normal curve is a bell shaped frequency curve. 


MULTIPLE - CHOICE QUESTIONS 


1. When data are classified according to a single characteristic, it is called: 


(a) quantitative classification (b) qualitative classification 
(c) area classification (d) simple classification 

2. Classification of data by attributes is called: 
(a) quantitative classification (b) chronological classification 


(c) qualitative classification (d) geographical classification 
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3. 


10. 


11. 


12, 


Classification of data according to locations or areas is called: 


(a) qualitative classification (b) quantitative classification 
(c) geographical classification (d) chronological classification 
Classification is applicable in case of: 

(a) normal characters (b) quantitative characters 

(c) qualitative characters (d) both (b) and (c) 

In classification, the data are arranged according to: 

(a) similarities (b) differences 

(c) percentages (d) ratios 

When data are arranged at regular intervals of time, the classification is called: 
(a) qualitative (b) quantitative 

(c) chronological (a) geographical 

When an attribute has more than three levels it is called: 

(a) manifold-division (b) dichotomy 

(c) one-way (d) bivariate 


The series 


Pakistan Britain 


Country 


Birth rate 45 40 Ae: | ee eee 
is of the type: 
(a) discrete (b) continuous 
(c) individual (d) time series 
The series x : 


is of the type: 


(a) inclusive (b) exclusive 

(c) geographical (d) time series : 

In an array, the data are: 

(a) in ascending order (b) in descending order 
(c) either (a) or (b) (d) neither (a) or (b) 

The number of tally sheet count for each value or a group is called: 
(a) class limit (b) class width 

(c) class boundary (d) frequency 


The frequency distribution according to individual variate values is called: 
(a) discrete frequency distribution (b) cumulative frequency distribution 
(c) percentage frequency distribution (d) continuous frequency distribution 
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17. 


18. 


19. 


20. 


21. 


22. 


23, 
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A series arranged according to each and every item is known as: 


(a) discrete series (b) continuous series 
(c) individual series (d) time series 

A frequency distribution can be: 

(a) qualitative (b) discrete 

(c) continuous (d) both (b) and (c) 


The following frequency distribution 


es ee ee 
ee ae ee 
is classified as; 


(a) relative frequency distribution (b) continuous distribution 
(c) percentage frequency distribution (d) discrete distribution 


Frequency distribution is often constructed with the help of: 
(a) entry table (b) tally sheet 

(c) both (a) and (b) (d) neither (a) and (b) 
The data given as, 3, 5, 15, 35, 70, 84, 96 will be called as: 

(a) individual series (b) discrete series 

(c) continuous series (d) time series 
Frequency of a variable is always in: 

(a) fraction form (b) percentage form 
(c) less than form (d) integer form 

Data arranged in ascending or descending order of magnitude is called: 
(a) ungrouped data (b) grouped data 


(c) discrete frequency distribution (d) arrayed data 

The grouped data are called: 

(a) primary data (b) secondary data 

(c) raw data (d) difficult to tell 

A series of data with exclusive classes along with the corresponding frequencies 
is called: 

(a) discrete frequency distribution (b) continuous frequency distribution 
(c) percentage frequency distribution (d) cumulative frequency distribution 
In an exclusive classification, the limits excluded are: 


(a) upper limits (b) lower limits 
(c) both lower and upper limits (d) either lower or upper limit 
The series 


[Weghes (pounds) 16-20 | 20-05 [25-00 | 00-08 | 5-00 | 
[Noorieme | v0 | = | w | w | 3 | 
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26. 


26. 


27. 


28. 
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380. 


31, 


82. 


is categorized as: 


(a) continuous series (b) discrete series 
(c) time series (d) geometric series 
The series 


[ie | | 


[a 


will be called as: 

(a) time series (b) discrete series 
(c) continuous series (d) individual series 
The suitable formula for computing the number of classes is: 
(a) 3.822 log N (b) 0.822 log N 

() 1+3,322logN @) 1-3,322 log N 


The number of classes in a frequency distribution is obtained by dividing the 
range of variable by the: 


(a) total frequency (b) class interval 

(c) mid-points (d) relative frequency 

If the number of workers in a factory is 256, the number of classes will be: 
(a) 8 (b) 9 () 10 (@ 12 


The largest and the smallest values of any given class of a frequency 
distribution are called: 


(a) class Intervals (b) class marks 

(c) class boundaries (a) class limits 

If there are no gaps between consecutive classes, the limits are called: 
(a) class limits (b) class boundaries 

(c) class intervals (d) class marks 


The extreme values used to describe the different classes in a frequency 
distribution are called: 


(a) class intervals (b) class boundaries 
(c) class limits (d) cumulative frequency 


If in a frequency table, either the lower limit of first class or the upper limit of 
last class is not a fixed number, then classes are called: 


(a) one-way classes (b) two-way classes 

(c) discrete classes (d) open-end classes 

The class boundaries can be taken when the nature of variable is: 
(a) discrete (b) continuous 


(c) both (a) and (b) (d) qualitative 
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40. 


41. 
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Class boundaries are also called: 


(a) mathematical limits (b) arithmetic limits 

(c) geometric limits (d) qualitative limits 

The average of lower and upper class limits is called: 

(a) class boundary (b) class frequency 

(c) class mark (d) class limit 

The lower and upper class limits are 20 and 30, the midpoints of the class is: 
(a) 20 (b) 25 (c) 30 (ad) 50 


A frequency distribution that contains a class with limits of “10 and under 20” 
would have a midpoint: 

(a) 10 (b) 14.9 

(c) 15 (a) 20 

If the number of workers in a factory is 128 and maximum and minimum 
hourly wages are 100 and 20 respectively. For the frequency distribution of 
hourly wages, the class interval is: 

(a) 8 (b) 9 fey 10 (d) 80 

Width of interval h is equal to: 


_, Jargest number — 20 largest. number + smallest number 
(@) “umber of classes ) number of classes 
© largest number — smallest number (a) number of classes 

c Siiies cae. 


number of classes range 
Length of interval is calculated as: 

(a) the difference between upper limit and lower limit 

(b) the sum of upper limit and lower limit 

(c) half of the difference between upper limit and lower limit 

(d) half of the sum of upper limit and lower limit 

The class marks are given below: 

10, 12, 14, 16, 18. The first class of¢he distribution is: 

(a) 9-12 (b) 10.5- 12.5 

@ 9-11 (d) 10-12 

If the midpoints are 10, 15, 20, 25 and 30. The last class boundary of the 
distribution is: 


(a) 25-30 (b)  27.5- 32.5 
(c) 20-35 (d) 30-35 
The number of classes depends upon: 

(a) class marks (b) frequency 


(c) class interval (d) class boundary 
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48. The class interval is the difference between: 


(a) two extreme values (b) two successive frequencies 
(c) two successive upper limits (d) two largest values 
44. When the classes are 40 - 44, 45 - 49, 50 - 54, ---, the class interval is: 
40 +44 
(a) 4 aan 
(c) 100 (d) 6 


45. A grouping of data into mutually exclusive classes showing the number of 
observations in cach class is called: 
(a) frequency polygon (b) relative frequency 
(c) frequency distribution (d) cumulative frequency 
The following frequency distribution 


Toes thand | Loos thand |, Lage thadi0 
Firegiey eee ici nate bl 


is classified as: 
(a) inclusive classification (b) exclusive classification 
(c) discrete classification (d) cross classification 


47. The following frequency distribution 
Classes 10-20 20-30 30 - 40 40-50 50-60 


Frequency 2 6 4 


is classified as: 

(a) exclusive classification (b) inclusive classification 

(c) geographical classification (d) two-way classification 
48. The following frequency distribution 


is classified as: 

(a) multiple classification (b) qualitative classification 
(c) inclusive classification (d) exclusive classification 
The following frequency distribution 


More thand | More than6 | More than8 | More thanl0 | More than12 


is classified as: 
(a) geographical classification (b) chronological classification 
(c) inclusive classification (d) exclusive classification 
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50. The class frequency divided by the total number of observations is called: 


(a) percentage frequency (b) relative frequency 
(c) cumulative frequency (d) bivariate frequency 
51. The relative frequency multiplied by 100 is called: 
(a) percentage frequency (b) cumulative frequency 
(c) bivariate frequency (d) simple frequency 
52. Ina relative frequency distribution, the total of the relative frequencies is: 
(a) 100 (b) one 
(c) if (d) =X 


53. In a percentage frequency distribution, the total of the percentage frequencies 
is always equal to: 


@ 1 (b) if 
() 100% (d) =X 
54, The cumulative frequency of first group in more than cumulative frequency 
distribution is always equal to: 
(a) 1 (b) 100 
(ce) if (d) =X 


55. The cumulative frequency of last class in less than cumulative frequency 
distribution is always equal to: 
(a) =f (b) =fX 
@ 1 (a) 100 

56. The following frequency distribution 


Less than 10 | Less than 20 | Less than’30 | Less than 40 
Le ee a ee ae, | 


is classified as: 

(a) less than cumulative frequency distribution 

(b) more than cumulative frequency distribution 

(c) discrete frequency distribution 

(d) cumulative percentage frequency distribution 
57. The following frequency distribution 


Frequency 40 


is classified as: 
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(a) relative frequency distribution 

(b) less than cumulative frequency distribution 

(c) more than cumulative frequency distribution 

(d) bivariate frequency distribution 

A frequency distribution formed considering two variables at a time is called: 
(a) univariate frequency distribution 

(b) bivariate frequency distribution 

(c)  trivariate frequency distribution 

(d) bimodal distribution 

The sum of rows or sum of columns of a bivariate frequency distribution is 
equal to: 


(a) =X (b) ZfX 

(©) X(f£+X) (d) Xf 

The arrangement of data in rows and columns is called: 

(a) classification (b) tabulation 

(c) frequency distribution (da) cumulative frequency distribution 


When the qualitative or quantitative raw data are classified according to one 
characteristic, the tabulation of different groups is called: 


(a) dichotomy (b)  manifuld-division 
(c) bivariate (a) one-way 
A statistical table consists of at least: 
(a) two parts (b) three parts 
(c) four parts (a) five parts 
In a statistical table, prefatory note is shown: 
(a) below the body (b) box head 
(c) foot note (dj) betow the title 
A source note in a statistical table is given: 
(a) at the end of a table (b) in the beginning of a table 
(c) in the middle of a table (d) below the body of a table 
In a statistical table, column captions are called: 
(a) box head (b) stub 
“(c) body (d) title 
In a statistical table, row captions are called: 
(a) box head (b) stub 
(c) body (d)_ title 
The headings of the rows of a table are called: 
(a) prefatory notes (b) titles 
(c) stubs (d) captions 
The headings of the columns of a table are called: 
(a) stubs (b) captions 


(c) footnotes (d) source notes 
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69. The budgets of two families can be compared by: 


(a) sub-divided rectangles (b) pie diagram 
(c) both (a) and (b) (d) histogram 
70. Total angle of the pie-chart is: 
(a) 45 (b) 90 (c) 180 (d) 360 
71. Diagram are another form of: 
(a) ~classification (b) tabulation 
(c) angle (d) percentage 
72. In pie diagram, the angle of a sub-sector is obtained as: 
(a) eoeps part x 90° (b) eee part x 180° 
© Se part x 120° @ aE part x 360° 
78. A pie diagram is represented by a: 
(a) rectangle (b) circle 
(c) triangle (d) square 
74. A sector diagram is also called: 
(a) bar diagram (b) histogram 
(c) historigram (d) pie diagram 
75. Which of the followings is not a one-dimensional diagram? 
(a) simple bar diagram (b) multiple bar diagram 
(c) component bar diagram (d) pie diagram 
76. Which of the followings is a two-dimensional diagram? 
(a) sub-divided bar (b) percentage component bar chart 
(c) sub-divided rectangles (d) multiple bar diagram 
77. Pie diagram represents the components of a factor by: 
(a) circles (b) sectors 
(c) angles (d) percentages 


78. The suitable diagram to represent the data relating to the monthly expenditure 
on different items by a family is: 


(a) histerigram (b) histogram 
(c) multiple bar diagram (d) pie diagram 
79. A graph of time series or historical series is called: 
(a) histogram Xi (b) historigram 
(¢) frequency curve \ @ __ frequency polygon 
80. The historigram is the graphical presentation of data which are classified: 
(a) geographically (b) numerically 


(c) qualitatively (d) according to time 
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Historigram and histogram are: 

(a) always same (b) not same 

(c) off and on same (d) randomly same 

A distribution in which the observations are concentrated at one end of the 
distribution is called a: 


(a) symmetric distribution (b) normal distribution 

(c) skewed distribution (d) uniform distribution 

For graphic presentation of a frequency distribution, the paper to be used is: 
(a) carbon paper (b) ordinary paper 

(c) graph paper (d) butter paper 


Histogram can be drawn only for: 

(a) discrete frequency distribution (b) continuous frequency distribution 
(c) cumulative frequency distribution (d) relative frequency distribution 
Histogram is a graph of: 


(a) frequency distribution (b) time series 

(c) qualitative data (d)  ogive 

Histogram and frequency polygon are two graphical representations of: 
(a) frequency distribution (b) class boundaries 

(c) class intervals (d) class marks 

Frequency polygon can be drawn with the help of: 

(a) historigram (b) histogram 

(c) circle (d) percentage 


In a cumulative frequency polygon, the cumulative frequency of each class is 
plotted against: 


(a) mid-point (b) lower class boundary 
(ce) upper class boundary (d) upper class limit 
The graph of the cumulative frequency distribution is called: 

(a) histogram (b) frequency polygon 
(c) pictogram (d)  ogive 


When successive mid-points in a histogram are connected by straight lines, the 
graph is called a: 


(a) historigram (b) ogive 

(c) frequency curve (d) frequency polygon 
A frequency polygon is a closed figure which is: 

(a) one sided (b) two sided 

(c) three sided (d) many sided 

Ogive curve can be occurred for the distribution of: 

(a) less than type (b) more than type 


(c) both (a) and (b) (d) neither (a) and (b) 
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The word ogive is also used for: 
(a) frequency polygon (b) cumulative frequency polygon 
(c) frequency curve (d) histogram 
Cumulative frequency polygon can be used for the calculation of: 
(a) mean (b) median 
(c) mode (d) geometric mean 


Answers 


@|6 ©|7% @]s © 
(c) 


(a) | 66. (b) | 67. ©) 


(©) 64. (a) 


69.) | 70. @ | 71. @) | 72. @ 
(b) | 74.) | 75. @ | 76. (© | 77. () | 78. @ | 79. (| 80. (a) 
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. (d) | 90. (d) | 91. (d) | 92. © | 93. ) | 94. (b) 
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SHORT QUESTIONS 


Qi 
Q.2 
Q.3 
Q.4 
Q.5 
Q.6 
Q.7 
Q.8 
Qs 
Q.10 


Q.11 


Q.12 
Q.13 
Q.14 
Q.15 


What is classification? 

What are the main objectives of classification? 

Explain the simple classification. 

Write four important bases of classification of data. 

Write down the different kinds of classification. 

Mention the different characteristics of classification. 

Describe the modes of classification. 

Differentiate between inclusive classification and exclusive classification. 
Distinguish between one-way and two-way classification. 

How do you find the class interval in the construction of a frequency 
distribution? 

Write down the formulas for the number of classes and class interval for a 
frequency distribution. 

Define the terms class limits and class boundaries. 

What are the open-end classes? 

What is meant by a frequency distribution? 

Write down the main steps i olved in the construction of a frequency 
distribution from individual obs 
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Q.16 
Q.17 


Q.18 
Q.19 
Q.20 
Q.21 
Q.22 
Q.23 
Q.24 


Q.25 
Q.26 
Q.27 
Q.28 
Q.29 
Q.30 
Q.31 
Q.32 
Q.33 
Q.34 
Q.35 
Q.36 
Q.37 
Q.38 


Q.39 
Q.40 
Q.41 
Q.42 
Q.43 
Q.44 
Q.45 
Q.46 
Q.47 
Q.48 
Q.49 
Q.50 
Q.51 
Q.52 
Q.53 


Explain the construction of a frequency distribution. 

Write down a sequence of steps that can be followed to assist in preparing a 
frequency distribution that provides a good representation of the raw scores. 
Differentiate between grouped data and ungrouped data. 

Define relative frequency distribution. 

What is meant by cumulative frequency distribution? 

Write a short note on cumulative frequency distribution. 

Explain the percentage frequency distribution. 

Define a bivariate frequency distribution. 

What is the difference between univariate frequency distributions and 
bivariate frequency distributions? 

Write a short note on cross table or contingency table. 

Define the term tabulation. 

What is meant by tabulation of statistical data? 

Write down the general rules of tabulation. 

Write down the important points to prepare a good table. 

Describe the main parts of the table. 

Differentiate between box-head and stub. 

Define the terms pre-factory note and footnote. 

Name the different types of diagrams used in representing data. 

What are the advantages of diagrams? 

Write down the name of diagrams which are one-dimensional. 

Write down the formula of the angle of a sub-sector used in pie chart. 
Define component bar chart. 

Write down the main steps in the construction of pie diagram or sector 
diagram. 

Explain the multiple bar diagrams. 

Differentiate between a component bar chart and a pie chart. 

Define the skewness. i 

What is histogram? 

Define frequency polygon. 

Write short note on cumulative frequency polygon. 

Write down the important graphs of frequency distributions. 

What are the merits of graphs? 

Distinguish between histogram and historigram. 

Write any six rules for drawing graphs. 

Define the term ogive. 

Explain the symmetry. 

Name the different types of diagrams which are two-dimensional 
Distinguish between diagrammatic and graphic representation, 
Differentiate between diagrams and graphs. Write names of graphs of 
frequency distributions. 
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EXERCISES 


Q.1 Array the following raw data from a sample of n = 10 midterm examination 
scores in statistics: 
63, 45, 95, 83, 90, 78, 60, 88, 93, 54 
Q.2 The following values relate to the number of members in various families. 
Make a frequency distribution, taking one as the size of interval. 


[s{9foT7][e6[s[4f2[3fale[s{eliulilelo[s [7] 
reholstrtetetstetstttstetsts tats rts Tete 
L5]6[3fs]7{7fiofife {i | 5 


[s]3[s5[e]7[6fiofe[s} 


Q.3 The following are the number of flowers on different branches of a tree. Make 
a discrete frequency distribution, taking a class interval of size one. 


[s]2[2[a[2fif2fiofsfo’ [2 [s[ofa[sfiola[2fo[e[7[s[2 [a] 
is{sfiofsfo[sfifofs[sf2[s]a]el7]sfels[a]s[s[slafe]s| 


Q.4 The following data shows the number of heads in an experiment of 50 sets of 
tossing a biased coin 5 times. Make a discrete frequency distribution. 


ate eee ape eae Jofa[sf2{sfefil2ys| 
[24] shite tlstebat Hats latrtttstotetetera tie 


Q.5 Read the following passage and observe the word length of each word. Make 
a frequency distribution of word length. 
“Statistics is very important to the banks. The banks do business with the 
help of deposits in the banks. The banks do not keep all the deposits with 
them. They do business with it and lend the money to the borrowers. The 
banks work on the assumption that all depositors do not withdraw their 
deposits on the same day. The bankers use statistical approach based on 
probability to estimate the number of depositors and their claims for a 
certain day.” 

Q.6 — Tabulate the following marks into a frequency distribution taking 10 as class 

interval by inclusive method and 45 as the lowest limit. Make the frequency 

polygon for the frequency distribution so obtained. 


59 | 62 | 96 | 
m7 Horr or a Lae] 
Q.7 The following is a record of weights of 70 students (in lbs.). Tabulate the data in 
the form of Steg Haney. distribution es the classes like 60 — 64, 65 — 69,. 


[102 [92 | 9 
[72 aaa | a01 
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Q.8 The following data give the index numbers of 50 commodities in a certain 
year. Tabulate the data into a frequency distribution taking a class interval 
of size 5. 


Q.9 Form a frequency distribution of the following taking 6 as class interval by 
exclusive method. 


88.1 77.8 74.2 80.7 
65.7 


5. 58.3 69.4 83.2 82.7 
73.8 67.6 81.9 73.5 77.6 48.6 83.5 70.8 


Q.10 Make a frequency distribution taking the classes as 1.19 — 1.23, 1.24 — 1.28, 
etc., from the following data: 
| 1.76 


147 lide 168 u| 1.60 : 
126 | 159 | 164 [| 146 | 150 | 119 | 14s | 124 
161 | 1.38 | 141 | 1.50 1.56 
1.65 | 1.45 | 167 | 154 | 164 | 1395 | 149.| 145 


Also make the class boundaries. 
Q.11.G) The grades in Mathematics of 50 students are as under: 


1.76 


Form a frequency distribution taking intervals of 5 grades like 50 — 54, 
55 — 59, ete. 


(ii) Draw a histogram of the grouped data formed in part (i) on graph paper. 
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Q.12 The following table shows the diameters in millimetres of a sample of 60 ball 
bearings manufactured by a company. Convert to a frequency table with 6 
classes, beginning with 7.24 by using inclusive classification. 


Q.13 The following table gives the weights in pounds of 40 students at a college. 
“ Construct a frequency distribution of the weights using appropriate class 
interval. 
% 128, 150, 156, 145, 147, 165, 142, 140, 154, 135, 158, 118, 145, 146, 
163, 161, 157, 180, 135, 149, 138, 140, 125, 126, 153, 144, 168, 135, 
132, 144, 147, 150, 152, 142, 164, 148, 173, 138, 136, 146. 
Q.14The following data represent reported sales (in millions of rupees) for 26 
companies in the shoe industry: 
32, 36, 54, 38, 17, 41, 22, 33, 22, 32, 31, 21, 18, 
46, 36, 11, 31, 29, 12, 23, 51, 12, 13, 37, 33, 27. 
Construct a frequency distribution using classes with a width of 10 i.e; 10 - 20, 
20 — 30, etc. 


Q.15 The class marks for the ages of sales clerks employed in a department store are 
18.5, 28.5, 38.5, 48.5, 58.5 and 68.5. Find the class boundaries of this 
distribution. 


Q.16 The daily expenditure (rupees) in an office are grouped into a table having the 
classes 100 — 149, 150 - 199, 200 - 249, 250 — 299 and 300 - 349. 


Find (i) the class boundaries Gi) the class marks (iii) the class interval 


Q.17 Write down the class boundaries, class marks and class intervals for each of 
the following classes. 


(a) 4-9, 10-15 (b) 2.1-2.4, 2.5-2.8 
() -3-(+3), 4-10 (d) 8,12, 16 

Q.18 Given the frequencies of 5 classes 2, 6, 8, 4, 5. Find relative frequencies and 
cumulative frequencies. 


Q.19 The following are the shipping weights in pounds of 60 supposedly identical 


office desks, 
100-100.9 | 101-101.9 | 102-102.9 


; 
Number of desks imi an ee er 


Find: (i) the class boundaries (ii) the class marks (iii) the relative frequency 
distribution (iv) the cumulative percentage frequency distribution 
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Q.20 Consider the following data: 
90-109 | 110-129 


Hourly wages ( Rs.) | 30 — 49 


Determine: (i) Relative frequency distribution 
(ii) Percentage frequency distribution 
(iii) More than cumulative frequency distribution 
(iv) Less than cumulative frequency distribution 
(v) Cumulative percentage frequency distribution. 
Q.21The masses measured to the nearest kg. of 50 boys are noted and the 
distribution formed. 


Masses ( kg.) 
Number of boys 


[ene [a-w [on [woe |) 
6 12 14 10 6 


Construct relative frequency, percentage frequency, cumulative relative 
frequency and cumulative percentage frequency distributions. 


Q.22 Given the following frequency distribution, fill in the missing frequencies: 
Hourly wages (Rs.) | No. of workers (f ) | Cumulative frequency (c.f ) 
50 - 70 10 10 


90-110 
110 — 130 + 
130 - 150 


Q.23 The daily wages in rupees are given in the following table. Find the missing 
values. 


Daily wages ( Rs.) No. of workers (f ) Cumulative frequency (c.f ) 


500 - 519 = 4 
520 - 539 20 = 
3 


540 - 559 


2 
560 - 579 - 
580 - 599 10 


600 - 619 
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Q.24 The daily wages in rupees are given in the following table. Find the missing 
values. 


Relative Cumulative 


No. of Workers 
Frequency Frequency 


mC e Ea od oo 


Q.25 Following are the monthly salaries of a sample of 50 workers in an office. 
Determine the missing values. 


Monthly Salary No. of workers 
(in thousand Rs.) (f) 


| 16 and under 18 _| | 16 and under 18 _| under 18 


Cumulative 
Frequency 


Relative 
Frequency 


0.12 


18 and under 20 AL 


20 and under 22 


22 and under 24 


Q.26 The masses measured in kilogrammes of 100 students are noted and the 
distribution formed. Construct a frequency distribution of data. 


Make a frequency distribution. Draw a histogram. 


(Chapter 2] Presentation of Data 


83 


Q.28 Make a bivariate frequency distribution for the data given below, taking the 
classes of heights such as 60 — 62, 63 — 65 etc. and the classes of weights as 
100 — 104, 105 — 109 etc. ; 


Heights | Weights | Heights | Weights | Heights | Weights 
(inches) | (pounds) | (inches) | (pounds) | (inches) | (pounds) 


Q.29 The following table shows the heights ( cm.) and weights ( kg.) of 60 students. 
Make a bivariate frequency distribution, taking a class interval of 5 cm. for 
nestle and 3 kg. for weights. 


ss [i 
vere] [| z 


52 


150 
61 


58 


Weights (kg,) beta 54 
Heights (em. | 162 | 160 | 170 
Frenne [oo [6 [oe [2 | 
ie [1 | on [a 
Pron cis]  [ os Tox [08 


[weiner] co [66 [ca [oa [oa [oo [os [on 
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Q.30 The areas of the various continents/country of the world in millions of square 
kilometers are given as: 


Continent / country Area ( million of square kilometers ) 


Africa 30.3 
Asia 

Europe 

North America 
Oceania 

South America 
US.S.R. 


Graph the data by (i) A simple bar chart. (ii) A pie chart. 


Q.31 The following table gives the details of monthly budgets of two families. 
Represent these figures by a suitable diagram. 


Rs.6000 
Rs.1000 
Rs.4000 


Food. 
Clothing 

House rent 
Fuel and lighting Rs.1000 
Miscellaneous Rs.3000 


Q.82 Draw a component bar chart to represent the following data: 


Expenditure in thousands of rupees 


1999 
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Q.33 The following table gives the details of monthly expenditure ( in thousand 
rupees ) of three families. Draw a sub divided rectangular diagram: 


Food 
Clothing 
House rent 
Education 
Recreation 
Conventional needs 
Miscellaneous 


Food 

Clothing 
House rent 
Education 
Fuel and light 
Miscellaneous 


Q.35 Given below is the area under wheat in Rawalpindi Division by districts in 


1989 — 90: 
Districts Attock Chakwal Jhelum Rawalpindi 
Area ( Hectares ) 162 135 56 135 


Represent the data by a pie chart. 
Source: Directorate of Agriculture Crop Reporting Service, Punjab, Lahore. 


Q.36 The following table gives the details of monthly expenditure (in thousand 
rupees) of three families. Construct a pie diagram to compare the budgets of 
three families A, B and C. 


10 
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Q.37 The following are daily numbers of automobiles rented by an automobile 
rental company in 90 business days. 


[| asan [anoa] onze [aa [a [eos 
Pep fatafule| «| 


Draw histogram and frequency polygon of this distribution. 
Q.38 Draw histogram and frequency polygon on the graph paper for the following 


frequency distribution. 
| Hourly wages (Rs. | 50-54 | 56.59 | 60-64 | 65.69 | 70-74 | 75-79 | 80-84 | 
| Number ofworkere | “4 [8 | iz [ 20 | 16 | 
Q.39 The following table gives the weights in pounds of 40 students at a college. 
Draw a cumulative frequency polygon. 


Weights | 113-126 | 127-135 153 | 154 - 162 
( pounds) 

students 

Q.40 The daily wages of 150 workers are given below in the form of grouped data. 


Draw frequency polygon, frequency curve and ogive polygon. 
Daily wages ( Rs.) | No. of workers (f} | Daily wages (Rs.) | No. of workers (f) 


[“as0-s00 | as) aso 00 | 
[~ s00=s50 | 0) two toof 
ETN Fe hang pane 


Automobile rentals 


Number of days. 


136 - 144 


Chapter 
3 
MEASURES OF CENTRAL TENDENCY 


3.1. INTRODUCTION 

In our daily life, it is quite common to hear the following statements. 

(i) The price of apples in the market is Rs.35 per kg. 

(ii) Mr. A studies for 8 hours daily. 

(iii) Death rate in Pakistan is 15 per thousand. 

(iv) Consumption of meat in a family is 5 kg. per month. 

(vy) The rain fall in a country is 25 cms. per year. 

(vi) Mr. B plays cricket daily for 2 hours. 

If we concentrate on the above statements, we find that none of them is the 
exact statement. The prices of apples are generally different’on different shops. On 
some particular day the prices may change from morning to evening. Statement 
(ii) is about the study hours of Mr. A. May be that Mr. A has never studied for 
‘exactly 8 hours. His study hours are approximately 8 hours. These statements are 
not exactly true but still they are very important. We can say that these are 
approximate statements about different situations. In statistics we say that these 
are the average statements. In our daily conversation, we make many statements 
which have some meaning only on average basis. When we talk about some 
numerical data or some situation is described in a numerical manner, the word 
average plays very important role. In different fields of life like agriculture, 
education, industry, prices etc., the idea of average is very important. There are very 
large number of experts at national and internat!ynal level who are involved in the 
study of averages of various kinds. Some experts are interested to find the average 
production of wheat per acre in different countries, some others may be interested in 
the average prices of wheat in those countries. Thus the word average is deeply 
linked with the situations which need some numerical study or analysis. The 
average is also called measure of central tendency. 

3.2. CALCULATION OF AVERAGE 

Average is a single value which is calculated to represent the whole data. It 
may be calculated for a sample or a population data. The average is a value which 
expresses the central idea of the observations. It is a single value used to represent 
the data. It is a value somewhere in the centre, where most of the items of the series 
cluster. Such values are also called measures of central tendency. Different averages 
are used in different situations to represent the data. A choice of the proper average 
is the job of an expert who is calculating the average. 


87 
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\_-8:8. QUALITIES OF A GOOD AVERAGE 
Following qualities are associated with a good average. We try to use that 
average which possesses all or most of these qualities. 
(i) It should be rigidly defined. lt means that there-should be no confusion about 
its calculation. 
(ii) It should be based on all the observations of the data: 
(ii) It should be capable of further algebraic treatment. 
(iv) It should be unaffected by extreme observations. 
(v) It should be easy to calculate and simple to follow. 
(vi) It should be least affected by fluctuations of sampling. 
3.4. TYPES OF AVERAGES 
The following averages are usually used: 
(1) ArithmeticMean. (2) Median, Quartiles and other partition values. 
(3) Mode. (4) Geometric Mean. 
(65) Harmonic Mean 
8.5, ARITHMETIC MEAN 
The arithmetic mean of a set of n observations is defined as the total of all the 
observations divided by the number of observations. The arithmetic mean of sample 


data is denoted by X and the symbol 1 (mue) is used for mean of population data. 
3.5.1. INDIVIDUAL OBSERVATIONS ( UNGROUPED DATA ) 
Let the variable X takes the values, X,, X,, X;, ..., X,, then the arithmetic 


mean, denoted by = (read "X bar") is defined by 
XitXe+Xs+..+X_ _ ZXi 
n Pay ede 


X = 


where, X = arithmetic mean, ©X, = sum of all the items of the variable X, 
n = number of observations. 
Example 3.1. 
Find the arithmetic mean of the set of numbers 84, 91, 72, 58, 87 and 78. 
Solution: 
= _ 2X _ 84+91+72+68+87+78 480 
xT ee ee pee a ee 80, 
n 6 6 
Example 3.2. 
Suppose that we are interested in examining how much a computer system is 
being used by various departments. Compute the arithmetic mean for the number of 


uses of the computer system from the following data. 
6 
325 


[Department number | 1 [ 2 | 3 | a 
_ EX 20+ 65 + 120+° + 350 + 325 + 310 1470 


Solution: 
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Example 3.3. 


The reciprocals of the values of the variable X are 0.0500, 0.0400, 0.0200, 
0.0285, 0.0143. Find arithmetic mean of X. 


Solution; The necessary calculations are given below: 


Here, n= 5, DX = 200, =X 20, 


Example 3.4. . 


The logarithms of 10 values of X are 1.8062, 1.2304, 1.6532, 1.5798, 1.4314, 
0.7782, 1.6812, 1.0414, 1.7559, 1.5315. Calculate arithmetic mean of X values. 


Solution: The necessary calculations are given below: 


Here, n= 10,2X= 347, Cy eae Sot 34.7 


Example 3.5. 


The mean salary paid to 1000 employees of a factory was found to be Rs.180.40. 
Later on it was discovered that the wages of two employees were wrongly taken as 
297 and 165 instead of 197 and 185. Find the correct mean. 


Solution: 
X= aK or EX= nX 
n 
Incorrect X = 180.40, Incorrect EX = 1000(180.40) = 180400 
Correct EX = Incorrect EX — Wrong items + Correct items 
= 180400 - 297 ~. 165 + 197 + 185 = 180320 


Correct EX _ 180320 


- = “1000 = Rs.180.32 


Correct X 


Example 3.6. 


(a) The mean of 15 values is 10. If one more value is included, the mean becomes 
12, Find the value which is included. 


(b) The mean of n values is 8. If a new value 28 is included, the mean becomes 9. 
Find the value of n. 
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Solution: 
(X= Sa 10 or ae 10(15) = 150 


Let A denote the value which is included, then mean of 16 values is 
IX+A IX+A 

Wi) Se BRE re 12 or EX+A = 12(16)= 192 or 

A = 192-XX= 192-150= 42 

Hence, the included value is 42. 

=n 5 


(b) X= Siiecs or IX = 8n ; 
When 28 is included, the total of (n + 1) values becomes EX + 28. 
Thus, mean of n + 1 values = 2+ t8 = 9 or IX+ 28=9(n+1)= 9n+9 
or <EX= 9n+9-28 = 9n-19 
8n= 9n-19 (since LX = 8n) 


or 9n-8n=19 or n= 19 
3.5.2. FREQUENCY DISTRIBUTION ( DISCRETE DATA) 


If f,, f,, {5, «.., £, be the frequencies of X,, X,, X;, ..., X, respectively, then the 
arithmetic mean is defined by 


fiXi+ fXe+fXe+...+fXn FAX EAR] 
fithtit. +h 9 Sh in 
where Zf,X; = Sum of the products of the frequencies and the variable X. 
n = f,+f£,+f+...+£,= Zf = Total frequency. 
Example 3.7. 


Given below is the frequency distribution of number of apples on 100 apple 
trees, Find the average number of apples per tree. 


Number of apples (X) | 500 | 
Number of trees (f ) }15 | 


Solution: 
The necessary calculations are given below: 


[00 | 880 | 

ee 
de 2x ere 
= 101 


X = 


iT] 


hc «gay 3 
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3.5.3, GROUPED DATA 

Suppose the observations are put into n different classes and the mid points of 
the classes are calculated which represent all the observations of the respective 
class. If the mid points of the classes are X,, X,, X;, ..., X, with corresponding 


frequencies f,, f,, f,, ..-, f,, then the arithmetic mean X is defined as 


x = PMLA AM. + foXet at fy 2X 
. fithtit+..th Sf 
Example 3.8, 


The following frequency distribution shows the hourly income of 100 
households in a locality. 


Income ( Rs.) | 35-39 | 40-44 | 45-49 | 50-54 | 55-59 | 60-64 | 65-69 
Frequency | 13 15 28 17 12 10 5 


Calculate the arithmetic mean and show that sum of the deviations of values 
from their mean is zero. 


Solution: 
The necessary calculations are given below: 


X = us = Son = 49,5.Hence Yf(X-X)=0 


3.5.4. WEIGHTED ARITHMETIC MEAN 

Sometimes different observations do not have the same importance. Some 
observations, for some reason, have greater importance. The relative importance 
called the weight (W) of the observations is thus determined. If the observations X, 
X,, Xy, .-, X, have the respective weights W, W. Wy --, Wy the weighted 
arithmetic mean denoted by X, is defined as 

= _ WiXit+ WeX2+ WsXst...+WnXn _ ZLWX 

Xy = Wit W2+ Ws+...+ Wn DSW: 


6 
t 
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Example 3.9. 

A student's final marks in Mathematics, Physics, English and Statistics are 
respectively 82, 86, 90 and 70. If the respective credits received for these courses are 
3, 5, 3 and 1, determine an approximate average marks, 

Solution: : 

The necessary calculations are given below: 


wef BWX esi 10 


Weighted Arithmetic Mean = Xy = =W 
3.5.5. COMBINED ARITHMETIC MEAN 


If Xi Xe Xe teak % be the arithmetic mean of k distributions with respective 


1 
iit = 84.67 or 85 


frequencies nj, nj, ng, ..., n,, the combined arithmetic mean X is defined by 
= _ mXi+noXe+nsks+... + nyXk _ EnjXj 
5G mitne+n3+...+nk es Beat 


Example 3.10. 
Find out combined mean from the following data: 


Solution: 


Here, n, = 80,n, = 60, X ead? and X, = 20. Therefore 


X= mXitmX: _ 80(12)+ 60°20) _ 2160 _ 
X= “mem =~ 80460 = 149 = 15.43 


Example 3,11 

In three separate weeks a discount store sold 10, 25 and 15 microwave ovens at 
average prices of Rs. 8000, Rs. 6000, and Rs. 6500 respectively. What is the average 
price of the ovens sold? 
Solution: 


Here ni = 10, n2=25, ns= 15, Xi = 8000, Xz = 6000 and Xs = 6500. ‘Therefore 


= MXit meXs + nsXs_ 10(8000) + 25(6000) + 15(6509) 
= TMitnmt+ns | 10+ 25+ 15 

327500 _ 
=e = 


#550 
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Example 3,12. 


(i) Average income of a group of 10 workers is Rs.49 per day. Another group with 
average income of Rs.60 per day has joined the first group. Both the groups 
have agreed to distribute their total income equally which comes to Rs.55, Find 
the number of workers in the second group. 

(ii) The average wage of 4 men is Rs.17 per week. What is the average wage of 
further 6 men if the average wage of all 10 men is Rs.20? 

(iii) The mean marks of 80 students in a certain class was 65. The mean marks of 
male students is 70 and that of female students is 50. Find the number of boys 
and the number of girls in the class. 

Solution: 

(@) Let n, be the number of workers in the second group 


- Xi + moX: 
We have, Xe niki + nats 
Here, By 12-10, X= 440, 160, X, = oben, = 7 


Substituting the values, we get 
10(49) + 60 ne 
55 = Wain or 490+ 60 n, = 55(10+n.) or 


10 + nz 
490+60n, = 550+55n, or 60n,-55n, = 550-490 or 
5n, = 60 or n, = 12 workers 
2 = miki + nzXz 
(ii) Wehave, X_ a oF 
Here, Pat oO) sn ie tar me Ores TT ran) Ue, gaa? 


Substituting the values in the above formula, we get 


(17) + 6X + 6X = 
20 = Aan + oh = 88+ Oi or 68+6X, = 20(10)= 200 or 


6X, = 200-68 = 132 or X,= 22 
Hence the average wage of 6 men is Rs.22. 


es rue mX1 + n2X2 a 
(iii) Wehave, X< = Serene ae and nj+n,=n 
Here, Xou= (65,05 =) 70uNy = 5G, n = 60), nm, =o mec? 
Substituting the values, we get 
_ 70 ni +50 ne . i 
65 = B0 or 70n,+50n, = 80(65)= 5200 «ss (1) 
nytnp= 80 | ore: (2) 


Solving equations (1) and (2), we get n, = 20 
Substituting n, = 20 in equation (2) , we get. n, + 20 = 80 or n, = 80—- 20=60 
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. 3.5.6. CHANGE OF ORIGIN 

When a certain constant, say A is added to all the observations, we get a new 
set of obsarvations. Similarly, the observations cgn be decreased when a constant is 
subtracted from all the observations. The addition or subtraction of a constant is 
called change of origin. This constant may be called arbitrary origin or working 
mean, 

Let X;—A = D,, where A is a constant and D, is the new variable, then 


X(X,-A)= =D, or ZX,-EA = =D, 
(ZA = A+A+A+... ntimes = nA) 
=X;-nA = =D, 

Dividing both sides of the equation by n, we get 


=Dj =Xi nA =D; 
git creat non nue 
55 = = =D; 
=D or X = A+D= A+ Rt 
Ef,Dj 


For any frequency distribution, XK = A + a 
i 


This relation can also be used for the calculation of X. Some people call it short 
method. If the size of the observations is very large, the computational work can be 
reduced by using the idea of change of origin. But we shall not suggest our students 


to use this method in general. Similarly if X, + A= D,, then X+A=Dor X=D-—A. 


This equation is only of academic interest and X is never calculated through this 
relation. 
Example 3.18. *. 

The wages of 5 workers in rupees are 1950, 2000, 2050, 2060, 2080. Calculate 
the arithmetic mean by using the idea of change of origin ( short cut method ) . 
Solution: : 

Let us take 2000 as the constant, then 


2080 | Total 


Po [wo | 


Thus, X = A+ 


Example 3.14. 
The deviations of the data about X = 180 are 4, 11, — 8, — 12, 7, 9, 16, 9, 13, 15. 
Calculate the arithmetic mean. 
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Solution: 
is [> To ] 
“sD 


Here, 2D = 64, n=10, A= 180, X=A+5> = 180+ 4 = 186.4 


Example 3.15, 
Given below is the grouped data of wages of 500 workers of a factory. Calculate 


the arithmetic mean by short method. 
1900 - 1950 | 1950-2000 | 2000 - 2060 | 2050 - 2100 | 2100-2150 
ae a Se a 


[Novel votes |_| 
Let us take A = 2025 as the arbitrary origin which corresponds to the 


Solution: 
maximum frequency. The necessary calculations are given below: 


Mid Point (X) 
No, of workers (f) 


= =fD = 
Thus, X = A+ yp = 2025+ 500. 2025-21.5 = 2003.5 


3.5.7. CHANGE OF SCALE 


When all the observations are divided by a certain constant or multiplied by a 
constant, the process is called change of scale. Let us take a variable u, called a 


Xx 
coded variable, where u; = me h being any constant. We have X, = hu, 


Taking summation of both sides, we have 
=X;= Thu = hzu, 
Dividing both sides by n, we get 


ZXj uj = fi 
n 7A or X= hu 


This formula can also be used for the calculation of X. Similarly if u, = hx, 


then = hX or X =U/h 
3.5.8. CHANGE OF ORIGIN AND SCALE 


When both the operations of addition (or subtraction) and multiplication (or 
division) with a constant are applied on the observations, we get a new variable Uj, 
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X,-A Xyt+A 
called the coded variable. Thus the values of u; = eae ae h(X,-A),u; = h 


and u, = h(X,+A) are the result of the operations of change of origin and scale. 


If u, = —p> thenX, = Athy, 


Applying summation on both sides 

2X, = O(Athu) = nA+hzy, 
Dividing both sides by n, we get 

=X; nA+hZuji nA 


Dui = = 
Sas si =e +h Thus X= A+hu 


This formula can be used for the calculation of X. For frequency distribution 

the relation will be 
Xk = Ath Be = Athi 

It may be noted ‘sat = there is some relation between two variables say X; and 

u, the same relation exists between their means. If 
2 See A ee ee ls aeeie 

= yy hen u, = h(X,;-A), then u = h(X—A) 
Example 3.16. 


The weights of 150 students are given below in the form of grouped data. 
Cilemiate the mean ny use of change of of origin and scale". 


P| 


Solution: 
The necessary calculations are given below: 


95 — 100 
100 — 105 
105-110 


110-115 
115 — 120 
120-125 
125 — 130 


x5 = 107.5+4.97 = 112,47 
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3.5.9. PROPERTIES OF ARITHMETIC MEAN 
Some of the mathematical properties of the arithmetic mean are: 


(i) The sum of the deviations of the values X, from their mean X is zero ie, 
=(X; x) = <X,- nX = nX-nX = 0 (ungrouped data ) 


BA(X\-X) = UfX,-KeR = Ket-Xzg= 0 (grouped data ) 


(i) The sum of the squares of the deviations of the values of a variable is least 
when the deviations are measured from their mean i.e, 


X(X-X)? s 2(X,-A)? (ungrouped data ) 


i 


 2A(X-X)? < Bf(X,-A)? (grouped data ) 
ii) If x x x, eae x be the arithmetic means of k distributions with respective 


frequencies nj, n,, n3, ..., n,, the combined mean x of the whole distribution is 
given by 
miXi + noXe + naXg +s + nuXk 


nit ne+n3+--- +nk 


xX, = 


c 


Gv): TEX), Xe hae see X, be the n observations having arithmetic mean X and if 
Y=aX+b, then Y=aX+ b, where a and b are any two numbers and a # 0. This 
is called linear transformation of X into Y. 


Example 3.17 
Let a variable X has the values as 40, 50, 60, 80 and 100. Find the mean of X 


and verify that D(X -X ) = 0. Also verify that 5(X -X <X(X-A)? if A= 60. Now 
multiply each number by 2 and subtract 50 to obtain the observations of a new 


variable Y. Find the mean of Y and show that Y = 2X - 50, 


Solution: 
The necessary calculations are given below: 
x X-X (X- Xk) X-A| (X-Ay?_ | y=2x-50 
40 — 26 676 - 20 400 30 | 
50 -16 256 -10 100 50 
60 EF | -6 36 0 0 70 
80 +14 196 +20 400 (ege10 
100 +34 1156 +40 1600 150 
DX=330_| DK-X)=0 | EK-X) = 2320 X(X-A)? = 2500 | _ EY =410 
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x = 2% 380 66 Hence Six %) x10. 
y=2t == gp U(X- K¥<E(X-A)}! 


4 and Y=2X-50 
2X - 50= 2(66)-50=82 - 
8.5.10. MERITS AND DEMERITS OF ARITHMETIC MEAN 
MERITS 
(i) It is rigidly defined. 
(ii) It is easy to calculate and simple to follow. 
(iii) It is based on all the-observations. 
(iy) It is determined for almost every kind of data. 
(vy) Itis finite and not infinite. 
(vi) Itis readily put to algebraic treatment. 
(vii) It is least affected by fluctuations of sampling. 
DEMERITS 
@ The arithmetic mean is highly affected by extreme values. 
(ii) It cannot average the ratios and percentages properly. 
(ii) It is not an appropriate average for highly skewed distributions. 
(iv) It cannot be computed if any item is missing. 
(v) The mean sometimes does not coincide with any of the observed value. 
3.6. MEDIAN 
Median is the middle most value of the n observations arranged in ascending or 
descending order of magnitude. Half of the observations are above the median and 
half are below the median. If n is odd, there is one term in the middle which is called 


median. If n is even, the average of the two central values is called median. For 
middle term, we can use the formula 


Median = Value of (224) th item 


In grouped data the number of observations is usually large, therefore we may 


n+l n . : 
use “p> or 9 for the middle term. It has become a strong convention that for 


n. « 
grouped data, gis used to get the middle term. 


Example 3.18(a). 
The wages of 5 workers in rupees are 1800, 1900, 1700, 2000 and 2200. Find 
the median. 
Solution: 
After arranging the observations in ascending order, we get 
Rs.1700, 1800, 1900, 2000, 2200 
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: pte F 5+1 ; 
Median = Value of( 2 ) thitem = Value of (4) th item 


= Value of 3rd item = 1900 
Example 3,18(b). 


The minimum temperature in Murree for the first 10 days of March was 
—1,-2, 1, 0, 3, 3, 4, 3, 2, 6. Find the median. 


Solution: 
After arranging the observations in ascending order, we get 
—2,-1, 0, 1, 2, 3, 3, 3, 4,6 


+ 10+1 
Median = Value of @ 2 ‘) thitem= Value of (2) th item 
‘ : 8 
ite t 
= Value of (5.5) thitem = Sih stem = Oth enn = 2s8 = 2.5 
3.6.1. MEDIAN FOR FREQUENCY DISTRIBUTION 
Suppose f,, f,, f,, ..., f, are the respective frequencies of the items X, X,, 


X;,...,X,. First we calculate the cumulative frequencies and then we see the median 


+1 : . 
number G 2 ) under the cumulative frequency column. The item which 
corresponds to the median number is called median. 
Example 3.19. 


Given below is the frequency distribution of number of chairs in different 
rooms of a college. Find the median. 


Number of chairs 30 52 
a 3 


Solution: = if nite x 
The necessary calculations are given below: 


Number of chairs (X) | Number of rooms (f) 


30° 3 


n+1 29+1 


2 


Median = Value of ( 2 ) th item = Value of( 


Value of (15) th item 
The cumulative frequency 15 corresponds to 40. Thus, Median = 40. 


) th item 
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Example 3.20. 
Given below is the frequency distribution of number of persons in 50 families in 
a village. Find the median as the average family size. 


[Nunbercfpenoes | a | 3 | 4 [| #0] 6] 
[Number of faniee | 6 | aie oT | 


Solution: 


The necessary calculations are given below: 


Median 


Value of #4 ) thitem = Value of Ot + 1) th item 


Value of (25.5) thitem = 5 

Hence the average size of the family is 5 persons on the basis of median as an 
average. 
3.6.2. MEDIAN FOR GROUPED DATA 

For grouped data, we find the cumulative frequencies and then we calculate 


the median number o The group which corresponds to the median number is called 


the median group. The median lies in this group. To interpolate median we use the 
formula 


Median = 1+ (2c) 

where / = lower limit of the median class 
h = size of the class interval of median class 
f = frequency of the median class 
n = sum cf the frequencies 


c cumulative frequency of the class preceding the median class. 
This formula is based on the assumption that the frequency (f) is uniformly 
spread in the median class. It is called linear interpolation of the median. 
Example 3.21. 
Find median from the following grouped data regarding heights of students in 
a college, 


Heights (in inches) | 56-58 | 58 - 60 


Number of students 25 


66 — 68 
20 
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Solution: 
‘The necessary calculations are given below: 


Heights (in inches ) | Number of students (f ) | Cumulative frequencies (c.f.) 


25 + 40 


65 + 250 
315 + 130 = 
445 + 60 
505 + 20 


Median = Value of( en item = Value of (£28) thitem = 262.5th item 


Median lies in the class 60 — 62. 
. 3 hfn_ 
Median = [+ f G ] 


where | = 60,h = 2f= 250, 2 = 2625 and c= 65 

Thus, Median = 60 + 55 (262. 5-65) =60+ sa (197.5) = 60 + 1.58 = 61.58 
Example 3.22, 

Calculate median from the following data: 


[ Groups lo-14 | 15-19 | 20-24 | 25-29 | 30 - at | 
Frequency 5 | 


Solution: 
The necessary calculations are given below: 


[cn oundanin (Cm) | & | Gamat omens 

9.6-14.5 5 5 
14.5 -— 19.5 6+12 = 
19.5 — 17+ 30= 
24.5 — 29.5 
29.5 — 84.5 


10-14 
15-19 
20 — 24 
25 = 29 
380-34 


i 


389th item 


Median = Value of( 3) thitem= Value of( % Ben item 
Median lies in the class 19.5 - 24.5. 
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h(n 
1+ (Bc) 


where / = 19.5, h = 5,f = 30, oe 39 and c= 17 


Median 


Thus, Median = 19.5 + 2 (39 -17)= 19.5+ Fen = 19.5 + 3.67 = 23.17 
3.6.3. MERITS AND DEMERITS OF MEDIAN 


MERITS 
(i) Itis easy to understand. (ii) It is readily calculated. 
(iii) Its graphic location is easy. (iv) It is not affected by extreme values. 


(v) It gives good results in a study of qualitative measurements. 
(vi) It is an appropriate average in a highly skewed distribution. 
(vii) Its value is least affected by the addition of a few more items. 
(viii) It can be calculated even if the data has open intervals at either end provided 
the median does not lie in the open intervals. 
DEMERITS 
(i) It is not capable of further mathematical treatment. For example if we know 
the medians of two sets of data, we cannot find the combined median for both 
the sets. 
(ii) It cannot give correct total when multiplied by the number of items, 
(iii) It is necessary to arrange the values in an array before finding the median. 
(iv) Median is more likely to be affected by the fluctuations of sampling than the 
arithmetic mean. 
(v) If big or small items in a series are to receive greater importance median would 
be an unsuitable average. Median ignores the values of extreme items. 
3.6.4. OTHER PARTITION VALUES 
Median divides the observations into two equal parts. Thus 50 % observations 
are less than median and 50 % are above the median. There are certain other 
partition values which divide the observations into different groups. These are 
quartiles, deciles and percentiles. 
3.6.5. QUARTILES 
There are three quartiles called first quartile, second quartile and third 
quartile, These quartiles divide the set of observations into four equal parts. The 
second quartile is equal to the median. The first quartile is also called lower quartile 
and is denoted by Q,. The third quartile is also called upper quartile and is denoted 
by Q,. The lower quartile Q, is a point which has 25 % observations less than it and 
75 % observations are above it. The upper quartile Q, is a point with 75 % 
observations below it and 25 % observations above it. 
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3.6.6. QUARTILES FOR INDIVIDUAL OBSERVATIONS 


( UNGROUPED DATA ) 
Q, = Value of @ ~ *) th item 
Q, = Value of atu thitem = Value of (es 1) th item = Median 
Q, = Value of so th item 
3.6.7, QUARTILES FOR A wacaaee DISTRIBUTION 
( DISCRETE DATA ) 
Q, = Value of(224 ‘) th item (n=2f) 
Q, = Value of ons) thitem = Value of a “ *) th item = Median 
Q, = Value of S&*4 th item 
3.6.8. QUARTILES FOR GROUPED FREQUENCY DISTRIBUTION 
h(n be 
Q, = i+ B(B-c] (n=<f) 


‘ 
Qe 1+ R(B-c) 21+ B(B-c} = Median 


hf 3n 
Qs T+ #(82-c) 


3.6.9. DECILES 
The deciles are the partition values which divide the set of observations into 
ten equal parts. There are nine deciles namely D,, D,, D,, .-., Dy. The first decile 1), 
is a point which has 10% of the observations below it. 
3.6.10. DECILES FOR INDIVIDUAL OBSERVATIONS 
( UNGROUPED DATA ) 


D, = Value of att J th item 


nh 
D, = Value of aoe) th item 


D, = Value of ath 


th item 


Value of Sen 


th item 


Dy 
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3.6.11, DECILES FOR A FREQUENCY DISTRIBUTION 

( DISCRETE DATA ) 


"a 
Value of (834) thitem (n=2f) 
Value of a+) th item 


ac) 
\ 


= 
i) 


+ 
Value of. rs Te th item 


eee 
" 


. 


Dy =+ Value of 20D item 
8.6.12, DECILES FOR GROUPED FREQUENCY DISTRIBUTION 
D, = 1+8(3--) (n= =f) 


hf 2n 
D, = 1+ (2c) 

h(3n_ 
oie 1448 ) 


ne 608-9 


3.6.18. PERCENTILES 

The percentiles are the points which divide the set of observations into ong, 
hundred equal parts. These points are denoted by P,, P,, Ps, «+, Pyg, and are called 
the first, second, third, ..., ninty ninth percentiles. The percentiles are calculated for, 
very large number of observations like workers in factories and the population in _, 
provinces or countries. The percentiles are usually calculated for grouped data. The 


first percentile denoted by P, is calculated as P, = Value of G) th item. We find 


the group in which the Too th item lies and then P, is interpolated from the formula 


h(_n h(_2n 
a 1+ Bets-9) ec t+ Foo" 9) 
99n 
P, = 1+ 3 (G3 -<) aoe Pog = 1+ 3( 22-9) (n= =f) 
It may be noted that all the partition points can be expressed in terms of 
percentiles. For example, Median = D, = Pio, Qi = Pos, Q3 = Pog, Dy = Pig, Dy = Pao, 
D; = Py and Dy = Pag. 
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Example 3.23. 
(a) Find lower quartile, median and upper quartile from the following data: 
8, 7, 12, 25, 37, 48, 15, 69, 52, 73, 70, 88, 82, 80, 92. 
(b) Compute lower and upper quartiles, 5th decile and 25th percentile from the 
given data: 


26, 22, 4, 80, 18, 11, 35, 41, 12, 32. 
Solution: 
(a) After arranging the observations in ascending order. We have 
3, 7, 12, 15, 25, 37, 48, 52, 69, 70, 73, 80, 82, 88, 92 
ti Lie hae 
Q, = Value of (® 7 ‘) thitem = Value ot( 3 ) th item 
= Value of (4) thitem = 15 


Median = Value of(* ‘) thitem =. Value of (8+ ) th item 


= Value of(8)thitem = 52 
Q, = Value of 2°) th item = Value of 45* th item 


= Value of (12) thitem = 80 
(b) After arranging the observations in ascending order, we get 
11, 12, 14, 18, 22, 26, 30, 32, 35, 41 
+1 + 
Q, = Value of eC 4 ) thitem = Value of er 1) thitem 

= Value of (2.75) th item 

= 2nd item + 0,75 ( 3rd item — 2nd item ) = 12 + 0.75 (14 — 12) 

= 124+1.5 = 13.5 


3(n +1 3 
Qs; = Value of Seen thitem = Value of aao+0 th item 


= Value of (8.25) th item 
= 8thitem + 0.25 (9th item — 8th item ) = 32 + 0.25 (35 — 32) 
= 82+0.75 = 32.75 


Ds = Value of aD thitem = Value of OED th item 


= Value of (5.5) th item 
= 5th item + 0.5 ( 6th item — 5th item ) = 22 + 0.5(26 — 22) = 22+ 2= 24 
+ 
Po, = Value of BAD th item = Value of 25 ne 1 th item 
= Value of (2.75) th item 
= 2nd item + 0.75 ( 3rd item — 2nd item )= 12 + 0.75 (14 - 12) 
= 12+1.5 = 13.5 


106 Basic Statistics Part-I 


Example 3.24. 


Eight coins were tossed together and the number of heads resulting was noted. 
The operation was repeated 256 times and the frequencies that were obtained for 
different values of X, the number of heads, are shown in the following table. 
Calculate median, quartiles, 6th decile and 27th percentile. 


Solution: 
Here the variable X is discrete and therefore X is written as 0, 1, 2, 3, ---, 8. The 


Pa [ea | 
n+1 


Median = Value of( 2 ) th item = Value of (28 th item 


necessary calculations are given below: 


= Value of (128.5) thitem = 4 


+ 56 + 1 
Q, = Value of (2 : 1) thitem = Value of (2 7) th item 
= Value of (64.25) thitem = 3 


Q = Value of S@*2 th item= Value of 3 256 + Dt item 


5 
Value of & an6 + 1 th item 


= Value of (154.2) thitem = 4 


+ 5 
Py;= Value of 2r(atD) th item = Value of = re } th item 


= Value of (69.39) thitem = 3 
Example 3.25, 


Find median, lower and upper quartiles, 5th decile and 44th percentile from 
the following observations. 


ase arses [10 | os 06 | 04 | 02 
[ie 6 9) fn 1) | 24 ies 20 elds hae 5 


= Value of (192.75) th item 
Ds = Value of Sax) th item 
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Solution: 


Here the variable X is continuous and the given values of X are the midpoints 


of the classes of grouped data. Hence we write the class boundaries. After arran 
the observations in ascending order, we have 


Class Boundaries 
0.1-0,3 5 
0.3 - 0.5 7 5+7 = 12 
0.5 - 0.7 13 12+13 = 25 
0.7 - 0.9 20 25+20 = 45 
0.9-1.1 33 45+33 = 78 
11-13 24 78+24 = 102 
13-15 15 102+ 15 = 117 
1.6-1.7 9 117+9 = 126 
1.7-1.9 6 126+6 = 132 


Value of(8) thitem = Value ot (432) thitem = 66th item 


46-9 


and ¢ = 45 


Median = 


Median lies in the class 0.9- 1.1. Median = 1 


where = h=02, f= 33, 2 = 66 


0.2 


0.9, 


Thus, Median = 


iy 
Q, = Value of a thitem= Value of 1s thitem = 33rd item 
a 4) 4 


Q= 1+ B(2-<) 


where 1= 0.7, h = 0.2, f= 20, 7 = 


Thus, Q, = 0.7+ 92 33-25)=0.7+ 95 (8) = 0.7 + 0.08 = 0.78 


Value of (22) th item = Value of 3 138 thitem = 99th item 


Q, lies in the class 0.7 — 0.9, 


33 and ¢ = 25 


Q = 


Q, lies in the class 1.1 - 1,3. Q; = 


Where / = 1.1, h = 0.2, f = 24, Sa = 99 and c= 78 


0.2 B 
Thus, Q,-= 1.1+ ‘94 (99-78) = heer 92 1) =11+0.175= 1.275 


ging 


fe 0.2 
0.9 + gg (66 — 45 ) =0.9+ 93 (21) =0.9+0.127=1 027 
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Value of 2 thitem = 66th item 


D,; = Value of( 5) th item 


h(5 
D, lies in the class 0.9-1.1. D,= J+ ie | 
n bn 
Where, / = 0.9,h = 0.2, f = 33, Jo = 66 and ¢ = 45 


Thus, D, = 0.9 + ori 3 6 66-45) =0.9+ 82 3 (21)= 0. .9 + 0.127 = 1.027 


P,, = Value of ( at thitem = Value of $4082) oy, item = 58.08th item 
h (44 
P,, lies in the class 0.9- 1.1. Py= 1+ b (tino) 
44n 
Where / = 0.9,h = 0.2, f = 33, too = 58.08 and c = 45 


Thus, P,, =0.9+ 92 (58.08 —45)=0.9+ 08 (13.08) = 0.9 + 0.079 = 0.979 


8.6.14. GRAPHIC LOCATION OF MEDIAN AND PARTITION VALUES 


Median and other partition values can be located from the graph of the 
cumulative frequency polygon ( ogive polygon ) . Suppose we have a graph of the 
cumulative frequency polygon as shown in the diagram below : 


Cumulative frequencies 


Q Median Q; 


Classes 
Fig.3.1. Graphical location of the partition values. 
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For median, we calculate n/2. On Y-axis, we mark the height equal to n/2 and 
from this point we draw a straight line parallel to X-axis which intersects the 
polygon at the point m. From the point m, we draw a perpendicular which touches 
the X-axis at M. This point on X-axis is the median. Similarly, for the lower quartile 
we take a height equal to n/4 on the Y-axis. From this point we draw a line parallel 
to X-axis which meets the pdlygon at the point q. From this point we draw a 
perpendicular on X-axis which touches it at the point Q, which is the first quartile. 
For. upper quartile take the height on Y-axis equal to 3n/4 and proceed in a similar 
way as described above to find Qs and also the other partition values. 

3.7. MODE 
Mode is that observation which occurs maximum number of times in the data, 

In some data the mode may not exist, and even if it does exist it may not be unique. 

If there are two modes, in a distribution, the distribution is called a bimodal 

distribution. 

Example 3.26. 

Identify the mode for each of the following lists of numbers: 

G) 4, 2, 6, 6, 6, 5, 9, 10, 13 and 15 

(i) 2, 3, 4, 4, 4, 5, 5, 7, 7, 7and9 

(iii) 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, and 22 

(iv) 8, 11, 4, 3, 2, 5, 10, 6, 4, 1, 10, 8, 12, 6, 5 and 7. 

Solution: 

(i) Since 6 occurs maximum number of times, therefore the mode = 6, 

(ii), There are two modes which are 4 and 7 because both items occur maximum but 
equal number of times. A distribution with two modes is called a bimodal 
distribution. 

(iii) Here, we have no mode because each item occurs the same number of times. 

(iv) Since each of the numbers 4, 5, 6, 8 and 10 occurs twice, we can consider that 
there are five modes. However, it is nore reasonable to conclude in this case 
that mode does not exist. 

3.7.1. MODE FOR A FREQUENCY DISTRIBUTION (DISCRETE DATA ) 

In a discrete series the value of the variable against which the frequency is 
maximum would be the modal value. 

Example 3.27, 


The, following are the size of shoes as worn by 10 persons. Calculate the modal 
size, 


Size of shoes 


Solution: 
Since the size 4.5 occurs the maximum number of times, therefore Mode = 4.5 
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3.7.2. MODE FOR GROUPED DATA 

In case of grouped data, the mode lies in the class which has the maximum 
frequency. This class is called the modal class. This is done when all the class 
intervals are equal. If the class intervals are not uniform, the frequencies are 
converted into the frequencies per unit interval and then the class with maximum 
frequency is used as the modal class. The mode lies within the modal class. The 
following formula is used for locating the mode 

Re (fm —f1) 0 (fm —f1) 
Mode fitter) te sea) ae Ms AS a va 


where J = _ lower limit of the modal class 
f, = maximum frequency of the modal class 
f, = frequency of the class preceding the modal class 
f, = frequency of the class following the modal class 


h = length of the class interval of modal class. 
Note 1: Sometimes two classes have equal maximum frequencies, in that case 
mode should not be calculated. 
Note 2; While calculating the value of mode it should be seen that the class 
intervals of the different classes are equal otherwise the above formula cannot be 
applied. When mode is illdefined the above formula is not applicable. In that case 
the value of mode is indirectly obtained by applying the following formula: 
Mode = 3 Median-2 Mean. 

Example 3.28. 

Compute the mode of the followin 
Classes 


Frequency 


distribution: 


Solution: 
Here the maximum frequency is 25 belonging to the class 28 — 35. So the mudal 
class is 28 — 35. Using the formula, 


ow fin —fi 
Mode’=" 0+ G@o=hyt Ge =8) 2 
Where! = 28,f, = 25, f,= 20, f,=18 and h=7 
Thus, Mode = 28+ eo x7 = 28+ >= x 7=2 = 
us, Mode = GS 2 F @STIH *7= 28+ Gey % 7 = 28+ 2.917 = 30.917 


73. MERITS AND DEM1.RITS OF MODE 
MERITS 

(i) It is easily located. 

(ii) It is not affected by the extreme values. 

(iii) It can be located in open ended classes. 

(iv) It is the most representative of the averages. 
(v) It is easily understood by a common man. 
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DEMERITS 

(i) It has no significance when number of items is not large. 

(ii) It is not based on all the observations, 

(ili) It is not capable of further mathematical treatment. 

(iv) Sometimes a distribution may have more than one mode. In this case mode 
should not be calculated. 

3.8. EMPIRICAL RELATION BETWEEN MEAN, MEDIAN AND MODE 


In a moderately skewed distribution, median lies in between the other two 
averages i.e, mean and mode. For a moderately skewed distribution there exists an 
empirical relationship among the mean, median and mode. The difference between 
mean and median is half of the distance between median and mode, The difference 
between mean and mode is three times the difference between mean and median, 
The relation between them is 


Mean—Median = ; ( Median — Mode ) 


Mean — Mode = 3( Mean — Median ) 
Median—Mode = 2Mean-2Median = 2( Mean — Median ) 


Mean = 4 (3 Median — Mode ) 


Median dy 2 Mean + Mode ) 


Mode = 3Median—2Mean 

Example 3.29. 

G) In a moderately skewed distribution, arithmetic mean = 24.6 and the mode 
= 26.1. Find the value of the median and explain the reason for the method 
employed. 

(ii) In moderately asymmetrical distribution, the value of median is 42.8 and the 
value of mode is 40. Find the mean, 

(iii) Find out the missing figure. 

Mean = ?(3 Median — Mode ) 

(iv) In a moderately asymmetrical distribution, the value of mean and median is 15 
and 16 respectively, Find out the value of mode, 

Solution: 

The necessary calculations are given below: 

(@) In a moderately asymmetrical distribution the following relationship holds 

good amongst mean, median and mode. 


Median = a 2 Mean + Mode ) 


Hence given any two averages, we can determine the third. 
Here, Mean = 24.6 and Mode = 26.1 


Thus, Median = 4 (2% 24.6+26.1) = $ (49.2 + 26.1) = 25.1 
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Gi) Mean = } (3 Median ~ Mode ) 
Here, Median = 42.8 and Mode = 40 
Thus, Mean= 3 (3% 42.840) = 3 (1284-40) = 44.2 


1 1 
(ii) Mean = 9(3 Median — Mode ). Hence the missing figure is 2 


(iv) Mode = 8 Median - 2 Mean 

Here Mean = 15 and Median = 16. Thus, Mode = 3(16) — 2(15) = 48 — 30 = 18 
3.9. GEOMETRIC MEAN 

The geometric mean 'G' of n positive values is defined as the nth root of their 
product. Thus, it is obtained by multiplying together all the n values and then 
taking the nth root of the product. Thus, 


1 
sad n 
G = [KR Kye Bye = WR. Ket Kas... Xn 
Where n-stands for the number of observations and X,, X,, X;, ..., X, are the 
various values. For instance the geometric mean 'G' of 2, 4, 8 is 
i i 
G = [2x4x8]*= [64> =4 
The above method of calculating geometric mean is satisfactory only if there 
are two or three observations. But if n is a large number, the problem of computing 
the nth root of the product of these values by simple arithmetic is a tedious work. To 
facilitate the computation of geometric mean we make use of logarithms. The above 
formula when reduced to its logarithmic form can be written as : 


logG = + [log X, + log X, + log X, + --- +logX,] = + Slog xX, 
L log X; 
or G = Antilog ( For ungrouped data ) 


The logarithm of the geometric mean is equal to the arithmetic mean of ‘the 
logarithms of individual values. The geometric mean is used when we are interested 
in averaging the ratios, for example, in the construction of index numbers, ‘the 
geometvic mean is the most suitable average for averaging the price or link relatives. 

If the n non-zero and positive variate-values X,, X,, Xy, ..., X, occur f), f; fy, «+, f, 
times respectively, then geometric mean G, is defined by 


1 
fi f fs falN 
G Xi+ Xa: Xa- + X_ where N = f,+ f+ fgt+... +f = Dfj 
logG = Lf log X, + flog X, + f, log X, +... +f, log X,] = Bf log X, 
Zfj log Xj Lj log X; 
G = Antilog Bits or G = Antilog ise. ( For grouped data ) 
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Example 3.30 
Given the following set of data from a sample of size 6: 
6, 3, 8, 4, 2, 10 
Calculate the geometric mean, 
Solution: 
The necessary calculatiois are given below: 


eee er pe 


G = Antilog (BeeX) = /Antilog (£0638) = 4.7524 


Example 3.31, 


The logarithms of 10 values of X are: 1.0792, 1.1761, 0.9542, 1.1761, 1.2041, 
1.2553, 1.3010, 1.3979, 1.3010, 1.4771. Calculate the geometric mean of X values. 


Solution: 
Here, n= 10 and Llog X, = 12.3220. Therefore 
4, ( 2lee xi) 4 (12.3220 
G = Antilog (Aes = Antilog ‘ee 
= Antilog( 1.2322 ) = 17.0687 
Example 3.32. 


The following are the reciprocals of the values of the variable X. Find the 
geometric mean of X: 0.1345, 0.0378, 1.2937, 0.4588, 0.0039. 


Solution: 
The necessary calculations are given below: 


[io [tear] 


5, ElogX = 4.9275 
Antilog (Seex) 


iT 


0.8710 
1.4226 


278 
~ 0.1135 = Antilog (422) 
9.3385 


= Antilog (0.9855) = 9:67 
2.4089 


Example 3.33, 


Find the average rate of increase in population which in the first decade 
increased by 20 %, in the next by 30 % and in the third by 40 %. 
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Solution: 
The necessary calculations are given below: 


100+ 20 = 120 2.0792 
100 + 30 2.1139 
100 + 40 2.1461 


6.3392 


G = Antilog (2¥sX) = Antilog ( sa ) = Antilog (2.1131) = 129.75 


Average rate of increase = 129.75-100 = 29.75 %. 
Example 3.24. 


Find the two numbers whose arithmetic mean is 9 and their geometric mean is 
TZ. 


Solution: 
Let the two numbers be X and Y so that 


X+ 
pa 9 and XY = 7.2 or XY = (7.2)? = 51.84 


X+¥Y = 18 and XY= 51.84 
We know that (X-Y)* = (X+Y)?-4XY 
Substituting the values, we get 


i 


(X-Y)? = (18)*-4(51.84) = 324-207.36 = 116.64 
X-Y = 11664 = 108 -..- (1) 
X+¥ 0 2] 18 LD eee (2) 


Solving equations (1) and (2) ,we get 
2X= 28.8 or X = 144 
Substituting X = 14.4 in equation (2), we get 
144+Y= 18o0r Y = 18-144 = 3.6 
Thus, the two numbers are 14.4 and 3.6. 
Example 3.35. 
Calculate geometric mean for the following distribution. 
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Solution: 


The necessary calculations are given below: 
Weights ( in lbs.) 
100 — 104 
105 - 109 
110-114 
115-119 
120-124 


2.0086 
2.0294 
2.0492 
2.0682 
2.0864 


48.2064 
60.8820 
92.2140 
134.4330 
150.2208 


tf 236 


\31, PROPERTIES OF GEOMETRIC MEAN 


(1) 
(2) 


(3) 


(4) 


The main properties of geometric mean are: 
The geometric mean is less than arithmetic moan, ic, G.M. < A.M. 


ee Ve ae 


2 
G= Antilog ( 128%.) . Antilog (85.9802) . Antilog (2.0891) = 
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The product of the items remains unchangeil if each item is replaced by the 


geometric mean. 


The geometric mean of the ratio of corresponding observations in two series is 


equal to the ratios of their geometric means. 


The geometric mean of the products of corresponding items in two series is 


equal to the product of their geometric means. 


\3.2. MERITS AND DEMERITS OF GEOMETRIC MEAN 
MERITS 


(i) 
ii) 

(iii) 
(iv) 


(v) It is not much affected by extreme values. 
DEMERITS 
(i) ‘Tt cannot be calculated if any of the observations is zero or negative. 


(ii) 
(iii) 
(iv) 


It is rigidly defined and its value is a precise figu 
It is based on all the observations 
It is capable of further algebraic treatment. 


It is not much affected by fluctuations of sampling. 


Its calculation is rather difficult. 
It is not easy to understand. 


It may not coincide with any of the observations. 
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3.10. HARMONIC MEAN 


Harmonic mean is defined as the reciprocal of the mean of the reciprocals of 
the items in a series. It is the ratio of the number of items and the sum of 


reciprocals of items. If the observations are X,, X,, X, --- , X,, then the harmonic 
mean H is 4 
H- Number of items Yi n ods 
~ Sum of reciprocals of items ~ 1 War. bys SOY is i ia 
Peo Ragy hae! ea ae Xj 


The calculation of H is not possible if any item is equal to zero. 
If the midpoints of the classes are X,, X,, X;, « , X, with corresponding 


frequencies f,, f, , f,, ««, f,, then the harmonic mean H is given by 


fit fot fot -- + $ 


pinous: eee ne 
pide Cup ee 
Example 3.36. 
Given the following set of data‘. 2 sample of size 5: 


13.2, 14.2 (4.8 15.2, 16.1 
Calculate the harmonic mean. 
Solution: 
The harmonic mean is calculated as i. : 


X(1/X) = 0.3417 
Example 3.37, 


Calculate the harmonic mean for the data given below : 


ome ee ae es 
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Solution. The necessary calculations are given below : 


|___ Marks f/X 
30-39 0.0580 
40-49 0.0674 
50-59 0.2018 
60 - 69 20 0.3101 
70-79 0.4295 

0.2959 
0.0741 
1,4368 
100 
= (4) = 1.4368 = 69.60 
x 


Example 3.38. 


The daily wages for a group of 236 persons have been obtained from a 
frequency distribution of a continuous variable X, after making the substitution 
X-112 
geese 


Calculate the harmonic mean. 
Solution: 
The necessary calculations are given below: 
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aan. MERITS AND DEMERITS OF HARMONIC MEAN 


Merits 
(i) It is based on all the observations. 
(ii) It is not much affected by the fluctuations of sampling. 
(iii) It is capable of algebraic treatment. 
(iv) It is an appropriate average for averaging ratios and rates. 
(v) It does not give much weight to the large items. 
Demerits 
(i) ‘Its calculation is difficult. 
(ii) It gives high weightage to the small items. 
(iii) It cannot be calculated if any one of the items is zero. 
(iv) It is usually a value which does not exist in the given data. 


SHORT DEFINITIONS 


Measure of Central Tendency 
A measure of central tendency is defined as a single value that is considered most 
representative of the whole set of data. 

or 
A descriptive measure tha‘ indicates the central position in a set of data is called 
measure of central tendency. 
Average 
An average is a central value which can represent the whole data. 

or 
‘The average is a value which locates the central position of the observations. 
Mean 
The mean of 2 set of data is found by adding up all the observations and dividing by 
the total number of observations. 

or 
The mean of a set of quantitative data is equal to the sum of the, measurements 
divided by the number of measurements contained in the data set. * 2 1 Bs 
Weighted Mean 
The mean for a set of data obtained by assigning each data tt a jeight that 
reflects its relative importance within the sct, is called weighted mean. 

or 
The mean of values when each value is weighted according to its relative 


importance, is called weighted mean. ; 
Median The Mih mest value 1 ol alia Daka j5 cllbrrd 
The median is the midpoint of the values after they have been ordered from the 
smallest to the largest or the largest to the smallest. I 

hs Ae ioe. in res oe te alt 
The median of a data is the middle item when the items are arranged in ascending 
or descending order. 
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Quartiles Ve wil, ws lib jn an a8 Peta 18 call 
Quartiles-arethe values of the variate that divide a set of data into four equal parts Aida 
afterarranging the observations in-ascending ordere ox 8, (lant) a. 
Deciles (M8}4on) ond Oz lin, 


Deciles are the values ofthe variate that divide a set of data into ten equal parts 
after arranging the observations in ascending order of magnitude. Dp, ) ADIs e sane Da 
Percentile 
Percentiles-a variate that divide a set of data into one hundred | 
equal parts after arranging the observations in ascending order of magnitude. p, Pa, eakey 
Vv Mode Moat nepetey Bete vale mw ate Data j4 colt Mo | e 
The mode is the measurement that deeurs with greatest frequency in a set of data. 4 
Mote Moor ese Men Lit Hotde nofepede value in dle Detasthen Ihe 
The value of the variable wath the largest frequency is called the mode. 
, Symmetric Data *t ile 
A set of data whose values are evenly spread around the centre is known as 
symmetric data. For symmetric data, the meanand the median are equal. ‘ 
_-Skewed Data 
A set of data that is not symmetrical is skewed data. For skewed data, the mean will 
be larger or smaller than the median.» _ 
~ Left-Skewed Data 

A data distribution is left skewed if the mean for the data is smaller than the 
median. 

\—Right-Skewed Data 
A data distribution is right skewed if the mean for the data is larger than the 
median. : oh 
Geometric Mean jht ndh Mow d Ae (Bod net of ALE nfl, Peoitve Yale’ 
The geometric mean of a set of n positive numbers is the nth root of their product., 

13 colleg Crsoméry ic Mow orits fubedla Lp-M abe. %% oJ % 
The geometric mean of non-zero and positive values is obtained by multiplying/all 
the values and then extracting the relevant root of the product. 

Harmonic Means B3 
Harmonic mean of a series js the reciprocal of the arithmetic mean of the reciprocals 
of the values 


~ 


: Syor 
The harmonic mean is condngted by averaging the reciprocals of the values of the 
wariable and finding the reciprocal of this average. 
Relative Position of Averages 
@) Inasymmetrical distribution, the mean, median and mode coincide. 4 
(ii) In a moderately asymmetrical distribution, the mean, median and mode are 
\related as mean — mode = 3 (medn — median) 

(it) If all the values in a variable are the same, then A.M. = G.M. = H.M. But if the 
values vary and none of values is zero or negative, then A.M. > G.M. > H.M. 
cm) The geometric mean of any two'values is equal to the geometric mean of their 

arithmetic mean and harmonic mean. 
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MULTIPLE - CHOICE QUESTIONS 


1, _ Any measure indicating the centre of a set of data, arranged in an increasing or 
* decreasing order of magnitude, is called a measure of: 


(a) skewness 4 (b) symmetry 
(o central tendency .. (d) dispersion 
2. Scores that differ greatly from the measures of central tendency are called: 
(a) raw scores (b) the best scores 
; (c)e" extreme scores (d)  z-scores 
3. The measure of central tendency listed below is: 
(a) the raw score (b)e" the mean 
(c) the range (d) standard deviation 
4, The total of all the observations divided by the number of observations is 
called: 
(a) arithmetic mean (b) geometric mean 
(c) median (d) harmonic mean 


5. While computing the arithmetic mean of a frequency distribution, the each 
value of a class is considered equal to: 


(a) class mark 6) lower limit 
(c) upper limit (d) lower class boundary 
6. Change of origin and scale is used for calculation of the: 
a)” arithmetic mean (b) geometric mean 
(ce) weighted mean (d) lower and upper quartiles 
7. The sample mean Xisa: 
(a) parameter WH Statistic 
(c) variable (d) constant 
8. The population mean p is called: 
(a) discrete variable (b) continuous variable 
parameter (d) sampling unit 
9. The arithmetic mean is highly affected by: 
(a) moderate values (b) extremely small values 
(c) odd values or xtremely large values 
10. The sample mean X is calculated by the formula J 
DfX 2D 
(a) vf (b) A+ Tt 
af 
© At Spxh (WY all of the above 
11. Ifa constant value is added to every observation of data, then arithmetic mean 
is obtained by: 
(a) subtracting the constant oY adding the constant 


(c) multiplying the constant (d)_ dividing the constant 
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12, Which of the following statements is always true? 
(a) The mean has an effect on extreme scores 
(b) The median has an effect on extreme scores 
(c) Extreme scores have an effect on the mean 


(d) Extreme scores have,an effect on the median . 
13. The elimination of extreme scores at the bottom of the set has the effect of: 
(a) lowering the mean (by raising the mean 
(c) no effect (d) none of the above 
14. The elimination of extreme scores at the top of the sct has the effect oft 
8) lowering the mean (b) raising the mean 
(c) no effect (d) difficult to tell 
15, The sum of the deviations takén from mean is : 
always equal to zero. (b) some times equal to zero 
(c) never equal to zero. (da) less than zero. 
16. If X= 25, which of the following will b. minimum: Ny 
(a) =(X-27)? Oy X(X - 252 
©) s(X—22)2, (@) 3(X +252 i 
17. The sum of the squares of the deviations about mean is 
(a) zero (b) maximum 
(} minimum (d) all of the above 


10 
18. If L (Xi:-50) = 100, then sample mean X will be: 


<=) 
(a) 10 (b) 50 
@ 60 (d) 100 
19. For a certain distribution, if £( X - 20) = 25, X(X- 25) = 0, and XX -: 
then X is equal to: 
(a) 20 ' or 25 
(ce) -25 (d) 35 
20. The sum of the squares of the deviations of the values of a variable is least 
when the deviations are measured from: 
(a) harmonic mean (b) geometric mean 
(c) median ~ (@* arithmetic mean 


21. If X.,X2,Xs,... Xa, be n observations having arithmetic mean XandifY=4X+2, 
then Y is equal to: 


(a) 4X (b) 4X _ 


(QA 4k +2 @) 442 
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22 IfX= 100 and Y=2X- 200, then mean of Y values will be: 


¢ 0 (b) 2 
(c) 100 (d) 200 

23. Step deviation method or coding method is used for computation of the: 
( arithmetic mean 3 (b) geometric mean 


(c) weighted mean its (d) harmonic mean 
24, Ifthe aay ee mean of 20 values is 10, then sum of these 20 values is: 


(a) (b) 20 
( 2 (a) 20+10 

25. Ten families have an average of 2 boys. How many boys do they have together? 
(a) 2 (b) 10 


(©) 12 (@ 20 x 
26. Ifthe arithmetic mean of the two numbers X: and X2 is 5 if Xi = 3, then X: is: 
(a) , 3 (b) 5 
of ri if (d@) 10 
27, Given X:= 20 and X2=- 20. The arithmetic mean will be: 
ow zero (b) infinity 
(c) impossible (d) difficult to tell 
28. The mean of 10 observations is 10. All the observations are increased by 10%. 
The mean of increased observations will be: 


(a) 10 (b) 1.1 
() 10.1 (@ 11 

29. The frequency distribution of the hourly wage rate of 60 employees of a paper 
mill is as follows: 


Number of Worker 


The mean wage rate is: 
(a) Rs. 58.60 
(c) Rs. 57.60 


30, The sample mean X of first n natural numbers is: 
(a) n(n + 1)/2 ~. (b) (n - 1)/2 
(c) n/2 wo (n + 1)/2 
41. The mean of first 2n natural numbers is: 


+1 2: 
(a) mpedt ©) n zal! 
2n+1 cael 
wat @ 5 ‘ 
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32. The sum of deviations is zero when deviations are taken from: 


(ay mean, (b) median. 
(c) mode. (d) geometric mean 
33. When the values in a series are not of equal importance, we calculate the: 
(a) arithmeticmean ~ (b) geometric mean 
(c) weighted mean (d) mode 


34, When all the values in a series occur the equal number of times, then it is not 
possible to calculate the: 
(a) arithmetic mean (b) geometric mean 
(c) harmonic mean (ad) weighted mean 

35. The mean for a sct of data obtained by assigning each data value a weight that 
reflects its relative importance within the set, is called: 


(a) geometric mean (b) harmonic mean 
(c) weighted mean (d) combined mean 


36. If Xi, X, Rae, Xx be the arithmetic means of k distributions with respective 


frequencies mi, nz, n3,...,nk, then the mean of the whole distribution Xe is given 


by: 
(a) EX/En (b) Yn/DX 
() EnX/En @) X(n+X)/En 
37. The combined arithmetic mean is calculated by the formula: 
(a) ny + ng b) 2 
mxXit ny X2 a nyX1 + nyX2 
© ny +ng (a) 2 


38. The arithmetic mean of 10 items is 4 and the arithmetic mean of 5 items is 10. 
The combined arithmetic mean is: 
(a) 4 (b) 5 
() 6 (d) 90 

39. The midpoint of the values after they have been ordered from the smallest to 
the largest or the largest to the smallest is called: 


(a) mean (b) median 
(c) lower quartile (d) upper quartile 
40. The first step in calculating the median of a discrete variable is to determine 
the: 
(a) cumulative frequencies (b) relative weights 


(c) relative frequencies (d) array 
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47. 


48. 


49. 


50. 


The suitable average for qualitative data is: 

(a) mean (b) median be 

(c) mode (a) geometric mean 

Extreme scores will have the following effect on the median of an examination: 

(a) they may have no effect on it (b) they may tend to raise it 

(c) they may tend to lower it (d) none of the above 

We must arrange the data before calculating: 

(a) mean, (b) median. 

(c) mode. (d) geometric mean 

If the smallest observation in a data is decreased, the average which is not 

affected is: 

(a) mode (b) median. 

(c) mean. (d) harmonic mean. 

If the data contains an extreme value, the suitable average is: 

(a) mean. (b) median. 

(c) weighted mean. (d) mode. 

Sum of absolute deviations of the values is least when deviations are taken 

from: 

(a) mean (b) mode 

(c) median (d) Qs 

The frequency distribution of the hourly wage rate of 100 employees of a paper 

mill is as follows: 
Wage Rate(Rs.) | 54-56 | 56-58 60-62 | 62-64 
Number of Workers 20 20 2 

The median wage rate is: 

(a) Rs. 55 (b) Rs. 57 

(c) Rs. 56 (d) Rs. 59 

The values of the variate that divide a set of data into four equal parts after 

arranging the observations in ascending order of magnitude are called: 

(a) quartiles (b) deciles 

(c) percentiles (a) difficult to tell 

The lower and upper quartiles of a symmetrical distribution are 40 and 60 

respectively. The value of median is: 

(a) 40 (b) 50 

(c) 60 (d) (60-40) /2 

Ifin a discrete series 75% values are less than 30, then: 

(a) Qs<75 (b) Qs<30 


(ce) Qs=30 (2 Qs > 30 
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51. 


52. 


53, 


54, 


56. 


57. 


58. 


59. 


60. 


61. 


If in a discrete series 75% values are greater than 50, then: 


(a) Qi:=50 (b+) Q:<50 

() Qi>50 (d) Qi: #50 

If in a discrete series 25% values are greater than 75, then: 
(a) Qi>75 “4 (b) Qi =75 

(©) Qs=75 (d) Qs>75 

If in a discrete series 40% values are less than 40, then: 

(a) Ds¥ 40 (b) Di < 40 

(c) Da>40 (d) Dsa=40 

If in a diserete series 15% values are greater than 70, then 
(a) Pis=70 (b) Pes = 15 

(c) Pas = 70 (d) Pw =70 

The middle value of an ordered series is called: 

(a) median (b) 5t decile 

(c) 50* percentile (d) all the above 
Ifin a discrete series 50% values are less than 50, then: 

(a) Q2=50 (b) Ds=50 

(c) Pso=50 (a) all of the above 


The mode or modal value of the distribution is that value of the variate for 
which frequency is: 


(a) minimum (b) maximum 

(c) odd number (d) even number 

Suitable average for averaging the shoe sizes for children is: 

(a) mean. (b) mode. 

(c) median. (a) geometric mean. 

Extreme scores on an examination have the following effect on the mode: 
(a) they tend to raise it (b) they tend to lower it 

(c) they have no effect on i: (d) difficult to tell 

A measurement that corresponds to largest frequency in a set of data is called: 
(a) mean (b) median 

(c) mode (a) percentile 


Which of the following average cannot be calculated for the observations 
2,2,4,4,6,6,8,8, 10,10? 


(a) mean (b) median 

(c) mode d) all of the above 
Mode of the series 0,0,0,2,2,3,3,8, LO is: 

(a) 0 ye 2 


12 eats! (1) no mode 
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63. 


64. 


65. 


66. 


67. 


68. 


69. 


70. 


71. 


72. 
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A distribution with two modes is called: 


(a) unimodal (b) bimodal 

(c) multimodal (d) normal 

The modal letter of the word “STATISTICS” is: 

(a) S * (b) T 

(c) bothS andI (d) bothS and T 

The mode for the following frequency distribution is: % 
Weekly sales of burner units | 1 2 3 Over 3 
Number of weeks [ [ 6 5 1 0 

(a) 50 (b) 38 

(ce) 0 (d) no mode 


Which of the following statements is always correct: 

(a) Mean = Median = Mode 

(b) Arithmetic mean = Geometric mean = Harmonic mean 
(c) Median = Qz= Ds = Pso 

(d) Mode = 2 Median -3 Mean 

In a moderately asymmetrical series, the arithmetic mean, median and mode 
are related as: 

(a) mean — mode = 3 ( mean — median ) 

(b) mean — median = 2 ( median — mode ) 

(c) median — mode = ( mean — median )/2 

(d) mode —- median = 2 mean — 2 median 

In a moderately skewed distribution, mean is equal to: 


(a) ( 3median — mode )/2 (b) ( 2mean + mode )/3 

(c) 3 median - 2 mean (d) 3 median — mode 

In a moderately asymmetrical distribution, the value of median is given by: 
(a) 3 median + 2 mean (b) 2mean+ mode 

(c) (2 mean + mode )/3 (d) (3 median - mode )/2 

For moderately skewed distribution, the value of mode is calculated as: 

(a) 2mean—-3 median (b) 8 median -2 mean 

(c) 2mean + mode (d) 3 median - mode 


In a moderately skewed distribution, mean = 45 and median = 80, then the 
value of mode is: 

(a) 0 (b) 30 

() 45 (a) 180 

If for any frequency distribution, the median is 10 and the mode is 30, then 
approximate value of mean is equal to: 

(a) 0 (b) 10 

(c) 30 fd) 60 
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73. 


74, 


75. 


76. 


77, 


78. 


79. 


80. 


81. 


82, 


83. 


In a moderately asymmetrical distribution, the value of mean and mode is 15 
and 18 respectively. The value of median will be: 
(a) 48 (b) 18 

(©) 16 (d) 15 
median — mode 


For moderately skewed distribution ohn mean is equal to: 

(a) 2 (b) 3 

(c) 1/2 (d) 1/3 

Which of the following is correct in a positively skewed distribution? 
(a) Mean = Median = Mode (b) Mean < Median < Mode 
(c) Mean > Median > Mode (d) Mean + Median + Mode 


if the values of mean, median and mode coincide in a unimodal distribution, 
then the distribution will be: 


(a) skewed to the left (b) skewed to the right 
(c) multimodal (d) symmetrical 
Accurve that tails off to the right end, is called: 

(a) positively skewed (b) negatively skewed 
(c) symmetrical (d) both (b) and (c) 


In a unimodal symmetrical distribution the highest point on the curve is called 
the: 

(a) mean (b) median 

(c) mode (d) all of the above 

If a set of data has one mode and its value is less than mean, then the 
distribution is called: 


(a) positively skewed (b) negatively skewed 

(c) symmetrical (d) normal 

Taking the relevant root of the product of all non-zero and positive valucs is 
called: 


(a) arithmetic mean (b) geometric mean 

(c) harmonic mean (d) combined mean 

The best average in percentage rates and ratios is: 

(a) arithmetic mean (b) lower and upper quartiles 
(c) geometric mean (d) harmonic mean 


The suitable average for computing average percentage increase in population 
is: 
(a) geometric mean (b) harmonic mean 


(c) combined mean (d) population mean 

If 10% is added to each value of variable, the geometric mean of new variable is 
added by: 

(a) 10 (b) 1/100 


(c) 10% (d) 1.1 
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86. 


87. 


88. 


89. 


90. 


91. 


92. 


93. 


94, 
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If each observation of a variable X is increased by 20%, then geometric mean is 
also increased by: 


(a) 20 (b) 1/20 

(c) 20% (d) 100% 

If any value in a series is negative, then we cannot calculate the: 
(a) mean (b) median 

(c) geometric mean (d) harmonic mean 


Geometric mean for X, and X, is: 
(a) VXi+X2 (b)  VXiX2 
© Vxi+VX (a) V2XiX2 


Geometric mean of 2, 4, 8 is: 


(a) 6 (b) 4 

(ce) 14/3 (@) 8 

Geometric mean is suitable when the values are given as: 
(a) proportions (b) ratios 

(c) percentage rates (d) all of the above 


If the geometric mean of the two numbers Xi and Xz is 9 if Xi = 3, then X2 is 
equal to: 


(a) 3 () 9 

() 27 (@ 81 

If the two observations are a = 2 and b = — 2, then their geometric mean will be: 
(a) zero (b) infinity 

(c) impossible (d) negative 

Geometric mean of — 4, — 2 and 8 is: 

(a) 4 (b) 0 

(c) -2 (d) impossible. 


The ratio among the number of items and the sum of reciprocals of items, is 
called: 


(a) arithmetic mean (b) geometric mean 
(c) harmonic mean (a) mode 
Harmonic mean for X, and X; is: 
2 2XiX2 

(@) 4 Xs : CODE Nicexs 

Xi + X2 Xi + X2 
WN Tee (@) “OXiX 
The appropriate average for calculating the average speed of a journey is: 
(a) median (b) arithmetic mean 


(c) mode (d) harmonic mean 
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95, 


96, 


97. 


98, 


99. 


100. 


101, 


102, 


103. 


104. 


105. 


106. 


Harmonic mean gives less weightage to: 


(a) small values (b) large values 

(c) positive values (d) negative values 

The harmonic mean of the values 5, 9, 11, 0, 17, 13 is: 

(a) 9.5 ‘ (b) 6.2 

fe) 0 (d) impossible. 

If the harmonic mean of the two numbers ¥) and Xz is 6.4 if Xz = 16, then X: is: 
(a) 4 (b) 10 

() 16 (d) 20 

If a = 5 and b = - 5, then their harmonic mean is: 

(a) -5 (b) 5 

(c) 0 (d) 2 

For an open-end frequency distribution, it is not possible to find: 
(a) arithmetic mean (b) geometric mean 

(c) harmonic mean (d) all of the above 

If all the items in a variable are non zero and non negative then: 
(a) AM>G.M>H.M (b) GM>AM>H.M 
(c) HM>GM>AM (dq) AM<GM<HM 


The geometric mean of a set of positive numbers Xi,X2,X3,...,Xn is less than or 
equal to their arithmetic mean but is greater than or equal to their: 


(a) harmonic mean (b) median 

(c) mode (d) lower and upper quartiles 
Geometric mean and harmonic mean for the values 3, — 11, 0, 63, - 14, 100 are: 
(a) Oand3 (b) 38and-8 

(c) Oandod (da) impossible 


If the arithmetic mean and harmonic mean of two positive numbers are 16 and 
4, then their geometric mean will be: 

(a) 4 (b) 8 

(©) 16 (d) 64 

The arithmetic mean and geometric mean of two observations are 16 and 8 
respectively, then harmonic mean of these two observations is: 

(a) 4 (b) 8 

(c) 16 (d) 32 

The geometric mean and harmonic mean of two values are 8 and 4 respectively, 
then arithmetic mean of values is: 

(a) 4 (b) 16 

(c) 24 (d) 128 

Which pair of averages cannot be calculated when one of numbers in the series 
is zero? 

(a) Geometric mean and median (b) Harmonic mean and mode 

(©) Simple mean and weighted mean () Geometric mean and harmonic mean 
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107. Ina given data the average which has the least value is: 


(a) mean. (b) median. 

(c) harmonic mean. (d) geometric mean 
108. If all the values in a series are same, then: 

(a) AM=G.M=H.M + (b) AM #GM+H.M 

(c) AM>G.M>ILM (d) AM<GM<HM 
109. The averages are effected by change of: 

(a) origin (b) scale 

(c) both (a) and (b) (d) none of the above 


Answers 


(a) | 5. (b) 


57. __(b) | 58. (b) | 59. ©) | 60. (© | 61. (c) | 62. (a) | G3. (h) | 64. (dd) | 
65. (c) | 66. (c) | 67. (a) | 68. (a)| 69. (c)| 70. (b) | 71. (a) | 72. (a) 
73. (c) | 74. (a) | 75. (c) | 76. (d)| 77. (a)| 78. (d) | 79. (a) | 80. (b) 
81. (c) | 82. (a) | 83. ©) | 84. © | 85. (©) | 86. (b) it 87. (b) | 88. (d) 

89. (c) | 90. (©) | 91. (ad) | 92. (c) | 93. (b) | 94. (d) | 95. (b) | 96. (d) 
97. (a) | 98. (d)| 99. (d) | 100 (a) | 101 (a) | 102 (d) | 103 (b) | 104 (a) 
105 (b) | 106 (d) | 107 (c) | 108 (a | 109 (c) 


SHORT QUESTIONS 
Q.1 What is meant by measures of central tendency? 


Q.2 What are the characteristics of a good measure of central tendency? 
Q.3 What is a statistical average? 
Q.4 Why do we calculate averages? 
Q.5 What are the desirable properties for an average? 
or 


Write down the qualities of a good average. 
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Qe 
Q7 
Q.8 


Q.9 
Q.10 
Q.11 
Qu. 
Q.13 
Q.14 
Q.15 
Q,16 


Q.17 
Ans. 


Q.18 
Ans. 


Q.19 


Ans. 
Q.20 


Ans. 
Q.21 


Ans. 
Q.22 


Ans. 
Q.23 


Ans. 


Why an average is called measure of central tendency? 
Define the importance of average in practical life. 
Name the important types of averages. 
os or 
Write down the different measures of averages. 
Describe the uses of various averages in practical life. 
Define arithmetic mean. 
Define the population mean, 
Define the sample mean. 
Write down the various methods of calculating arithmetic mean, 
Write down the mathematical properties of arithmetic mean, 
Write down the advantages of arithmetic mean. 
Write down the demerits of arithmetic mean, 
Given £(X - 10) = 2.8 andn = 6. Find the sample mean, 
10.56 
If the sum of deviations from X = 15 for 10 values is 25, find the mean. 
17.5 


The daily wages of 5 workers are Rs. 200, 300, 400, 500, 600 and the mean of 
these wages is Rs. 400. Find the mean after increasing the wages by 10%. 


Rs. 440 


The mean of 10 values is 15. If one more value is included, the mean becomes 
18. Find the included value. 


48 

The mean of 3 numbers is 20. If one more number is included, the mean 
becomes 25. Find the included number. 

40 

The mean of 5 observations is 60. Another item is included in the 
observations and now the mean becomes 62. Find the included item. 

72 

A student calculated the value of mean as 20 from 25 observations. It was 
later discovered at the time of checking that he had copied down two values 


as 7 and 18 while the correct values where 13 and 17. Find the correct value 
of the mean. 


20.2 
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Q.24 
Ans. 


Q.25 


Ans. 
Q.26 


Ans. 
Q.27 


Ans. 
Q.28 


Ans. 


Q.29 


Ans. 
Q.30 


Ans, 
Q.31 


Q.32 
Ans. 


Q.33. 


Q.34 
Q.35 


Q.36 


Ans. 
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The mean of n values is 10. If a new value 20 is excluded, the mean becomes 
8. Find the value of n. 
6 


Given u= a , fu = 100 and Ef = 200. Find arithmetic mean. 


172.5 
The average IQ of 10 students in a Statistics course is 114. If 9 of the 
students have IQs of 101, 125, 118, 128, 106, 115, 99, 118 and 109, what must 
be the IQ of 10th student? 

121 


A given stock was purchased at the following prices at various times. 20 
shares at Rs, 8.20 a share, 100 shares at Rs, 10.90 a share, 50 shares at 
Rs. 9.40 a share, 200 shares at Rs. 7.80 a share. Find the mean cost per 
share. 

8.88 


Show that the sum of deviations from mean is zero. 


E(Xi-K)=0. 
i=1 
Verify that 3 f (Xi-X)=0. 


Efi (&-)=0. 

iz 

For a certain distribution D(X - 21) = 18 and 2( X - 28) =- 24. 
Find the value of n and X. 


n=6andX= 24. 

Write short note on weighted arithmetic mean. or Define weighted mean. 
Given X = 1, 2, 3, ---, nand W = 1, 2, 3, ---, n. Find the weighted mean. 
(2n+1)/3 


Differentiate between simple arithmetic mean and weighted arithmetic 
mean. 


Define combined arithmetic mean. 
What is the formula of combined arithmetic mean? 


Given X, = 15, X, = 25, X, = 35 and X, = 5, Each mean is based on the same 


number of observations, find the overall mean. 
20 
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Q.37 


Given Xi = 2.53, x, = 1,97, xe = 4.05, X, = 3.33 and each mean is based on 


eight values. Compute Xz. 
2,97 


The arithmetic mean of 100 items is 5 and the arithmetic mean of 150 items 
is 4.8, Find combined mean. 


4.88 


Average weight of 10 students is 60 kg. and average weight of 15 students is 
64 kg. Find the average weight of all the 25 students. 


62.4 ke. 


The mean wage of 100 workers working in a factory running two shifts of 70 
and 30 workers respectively is Rs. 84. The mean wage of 70 workers working 
in morning shift is Rs. 90. Find the mean wage of 30 workers working in the 
evening shift. 

Rs. 70 


Define the term median. 

Enlist any four advantages of median. 

Write down any four demerits of median, 

Given / = 60, h= 10, f= 20, n=80andc= 30. Find median. 
65 

Detine the quartiles. Also write down its formulas. 

Define the deciles and explain how to coinpute. 

Write a short note on percentiles. 

Can all quartiles be expressed as percentiles? Explain. 

Qu = Pas, Q2 = Pso, Qs = Prs 

Define mode. or Describe the definition of mode. 

Give important advantages and disadvantages of the mode. 
Given 1 = 200, f,, = 25, f, = 20, f, = 20 and h = 10, Find mode. 
205 

Write down the empirical relationship be‘ n mean, median and mode, 


What is the relative position of the sucan, median and mode in the 
symmetrical and skewed distributions? 


Write the formulas of mean, median and mode in case of a frequency 
distribution. 
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Ans, 
Q.56 


Ans. 
Q.57 


Ans. 
Q.58 


Ans. 
Q.59 
Q.60 
Q.61 
Q.62 
Q.63 


Ans. 
Q.64 


Ans. 


Q.65 


Ans. 
Q.66 
Q.67 
Q.68 
Q.69 
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In a moderately skewed distribution, the value of the mean and median are 
120 and 110 respectively. Find the value of the mode. 


90 


In a moderately skewed distribution, mean = 25 and mode = 28. Find the 
value of the median. * 


26 


In moderately asymmetrical distribution, the value of median is 42 and the 
value of mode is 40. Find the mean. 


43 
Find out the missing figure: 


(a) Mean= ; (? median ~ mode ) 

(b) Median = ? (2 mean + mode ) 

(c) Mode = 8 median -? mean 

(a) 3 (b) 1/3 (c) 2 

Define geometric mean. 

Write down the properties of geometric mean. 
Write down any four advantages of geometric mean. 
Give any four demerits of geometric mean. 


The geometric mean of a series of 4 items is 10.2, find the product of all the 
items. 


10824.3216 


The geometric mean of two positive numbers is 10. By including the third 
number, the geometric mean becomes 8. What is the third number? 


5.12 


The geometric mean of a series of 3 values is 4, By including the 4th value, 
the geometric mean becomes 2. What is the 4th value? 
0.25 


Write short note on harmonic mean. 
Write down the advantages and disadvantages of harmonic mean. 
Explain how to compute geometric mean and harmonic mean. 


Explain the geometric mean and write down the relationship between 
arithmetic mean, geometric mean and harmonic mean. 
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Q.70 Write the formulas of arithmetic mean, geometric mean and harmonic mean 
of a frequency distribution. 


Q.71 Given X, = 4, X, = 4 and X, = 4, Show that A.M. = G.M. = H.M. 
Ans. A.M. =G.M.=H.M. =4 

Q.72 IfX, = 2 and X, = 8, show that A.M. > G.M. > H.M. 

Ans. 5>4>3,2 


Q.73 Show that the geometric mean of any two values is equal to the geometric 
mean of their arithmetic mean and harmonic mean. 


Q.74 The geometric mean and harmonic mean of two values are 20 and 16 
respectively. Find arithmetic mean of these values. 


Ans. 25 


Q.75 The arithmetic mean and harmonic mean of two positive numbers are 25 and 
16 respectively. Find their geometric mean. 


Ans. 20 


Q.76 The arithmetic mean and geometric mean of two observations are 25 and 20 
respectively. Find harmonic mean of these two observations. 


Ans. 16 


Q.77 A student computed the values of arithmetic mean, geometric mean and 
harmonic mean of a continuous variable as 40,32 and 50. Identify the values 
of arithmetic mean, geometric mean and harmonic mean. 


Ans. A.M. =50, G.M. = 40, H.M. = 30 
Q.78 Given Xi = 4 and Xz = 16. Show that G.M =\VA.M x H.M. 
Ans. G.M=VA.MxH.M=8 


Q.79 Given Xi = 3 and X2 = 27. Show that H.M = (G.M.)?/ A.M 
Ans. H.M.=(G.M.)?/ A.M = 5.4 


Q.80 Write down the relative position of averages. 


Q.81 Write down the formulas for different averages. 
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EXERCISES 


Q.1 Find the mean temperature of the following temperatures recorded in an 
experiment in the laboratory : 81, 94, 64, 80, 75, 69, 96, G6, 80, 91, 85, 79. 

Ans. 80 

2. The following table gives the hourly income of ten operators in a machine tool 
factory. Find the arithmetic mean. 


Name of operator 


Income ( Rs.) 


Q.3 (i) Compute the mean of the following sample values: 
1.3, 7.0, 3.6, 4.1, 5.0 
Show that sum of the deviations of values from their mean is zero. 
(ii) A variable Y is determined from a variable X by the equation Y = 10 - 4X. 


Find Y when X = - 3, - 2, - 1, 0, 1, 2, 3, 4, 5 and show that ¥ = 10 - 4X, 


Ans:(i) 4.2 (i)X=1, Y=6 

Q.4 The reciprocals of 11 values of X are given below. Calculate the arithmetic 
mean of X : 0.0500, 0.0454, 0.0400, 0.0333, 0.0285, 0.0232, 0.0213, 0.0200, 
0.0182, 0.0151, 0.0143. - 

Ans. 42.091 

Q.5 The logarithms of 10 values of X are: 1.8062, 1.2304, 1.6532, 1.5798, 1.4314, 
0.7782, 1.6812, 1.0414, 1.7559, 1.5315. Calculate arithmetic mean of X values. 

Ans. 34.7 

Q.6 A student calculated the value of mean as 20 from 25 observations. It was later 
discovered at the time of checking that he had copied down two values as 7 and 
18 while the correct values were 13 and 17. Find the correct value of the 
arithmetic mean. . 

Ans. 20.2 : 

Q.7 The mean of the 10 numbers is 8. If an eleventh number is included in the 
data, the mean becomes 9. What is the value of the eleventh number? 

Ans. 19 

Q.8 Arithmetic mean of 98 items is 50. Two items 60 and 70 were left out at the 
time of calculations. Find the mean of 100 items? 

Ans. 50.3 

Q.9 From the following frequency distribution find out mean height of the students, 


[Hight Cnches) [oa | 65 [66 | o7 | 68 [ e0 | 70 [ 11 | 72 | 73] 
[No.ofstudents [4 | 9 [x2 fre [oo] fote lata 
Ans. 68 
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Q.10 Show that sum of the deviations of values from their mean is zero. 


[Cinsees | 10-19 | 20-20 | 20-29 | 40-09 | 50-00 | 
Prrecueney [6 fe | « | 0 | 


Ans. X= 35, f(X-X)=0 ~ 
Q.11 Calculate the average marks from the following data by 

(i) the direct method (ii) the short-cut method. 
INowofetudene | 6 | @ | | 26 | @ | a | 2 | 
Ans. (i) 30.14 (ii) 30.14 


Q.12 The following table gives the daily wages of a group of persons working in a 
factory. Apply the step deviation method to find the mean daily wages. 


Daily wages Daily wages 
62 


155 — 157 170-172 
158 - 160 
161-163 
164 — 166 
167 — 169 
Ans. 168.76 


Q.13A student obtained 40, 50, 60, 80 and 45 marks in the subjects of English, 
Urdu, Mathematics, Statistics and Physics respectively. Assigning weights of 5, 
2, 4, 3 and 1 respectively for the above mentioned subjects. Find weighted 
arithmetic mean per subject. 

Ans. 55 

Q.14 The following table indicates the increase in cost of living over July 1992, for a 
working class family as at Ist July 1996, and the weights assigned to various 


groups, 


173 — 175 
176 — 178 
179-181 


Groups Percentage increase 


Food 
Rent 
Clothing 
Fuel and lighting 
Other items 


Find out the weighted average of the increase in cost of living. 
Ans. 47.02 
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Q.15 An examination was held to decide the award of a scholarship. The weights of 
various subjects were different. The marks obtained by 3 candidates (out of 100 


in each subject) are given below: 


4° 63 

3 65 70 

2 58 63 
52 


If the candidate getting the highest marks is to be awarded the scholarship, 
who should get it? 

Ans. Student C. 

Q.16 The arithmetic mean of 100 items is 5 and the arithmetic mean of 150 items is 
4.8, Find the combined arithmetic mean. 

Ans. 4.88 

Q.17 The teachers of statistics reported mean examination marks of 37.5, 41 and 42 
in their classes which consisted of 32, 25 and 17 students respectively. 
Determine the mean marks for all the classes taken together. 

Ans. 39.72. 


Q.18A distribution consists of four components with frequencies 15, 12, 16, 21, 
having their means 16.5, 20.3, 21.6 and 26.2. Find the mean of the combined 


distribution. 
Ans, 21.67 
Q.19 The mean heights and the number of students in three sections of a statistics 


class are a below: 
a 
37 43 


a 
Mean height a ren 


Find the combined mean of the whole class. 

Ans. 60,26 

Q.20The mean height of 25 students in section A of a class is 61" and the mean 
height of 35 students in section B of the same class is 58". Find the overall 
mean height of the 60 students. 

Ans. 59,25 

Q.21 The mean wage of 100 workers working in a factory running two shifts of 70 
and 30 workers respectively is Rs. 84. The mean wage of 70 workers working in 
morning shift is Rs. 90. Find the mean wage of 30 workers working in the 
evening shift. 

Ans. Rs. 70 


Subject 
Statistics 
Mathematics 


Economics 


Punjabi 
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Q.22The mean of marks in mathematics of 100 students of a class was 72. The 
mean of marks of boys was 75 while their number was 70. Find out the mean 


marks of girls in the class. 

Ans, 65 R 

Q.28 The average wage of 4 men is Rs. 17 per hour. What is the average wage of 
further 6 men if the average wage of all 10 men is Rs. 20? 


Ans. 22 
Q.24The mean marks obtained by 300 students are 56. The mean of the top 100 


students of them was found to be 80 and the mean of the bottom 100 of them 
was found to be 22. What is the mean of the remaining 100 students. 


Ans, 66 


Q.25 From the following data find the missing frequency when mean is 15.38: 


Mid value | 10 
No. of students 3 
Ans. 12 
Q.26 The arithmetic mean of the following series is 30.5 marks. Find the missing 
figure. 
Marks 10 20 2 40 50 
Frequency 8 10 20 15 7 
Ans, 30 


Q.27 Find out the median of the following values: 
(i) 8, 10, 12, 18, 20, 27, 32 
Gi) 5, 4, 8, 3, 7, 2,9 
(iii) 18.3, 20.6, 19.3, 22.4, 20.2, 18.8, 19.7, 20.0 
(iv) 22, 26, 14, 30, 18, 11, 35, 41, 12, 32 

Ans. (i) 18 (ii) 5 (iii) 19.85 (iv) 24 


Q.28 Iind the value of the median. 
80 - 100 |100- 120]120-140]140- 160]160- 180 


Hourly wages ( Rs.) 60-80 


No. of employees 


Ans. 109.67 
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Q.29 Compute median, 6th decile and 74th percentile for the following data: 


Diameter ( inches ) Diamefer (inches) 


0.7312 — 0.7313 10 0.7320 — 0.7321 
0.7314 — 0.7315 15 0.7322 — 0.7323 


0.7316 — 0.7317 20 0.7324 — 0.7325 
0.7318 — 0.73819 25 


Ans. Median = 0,73183, De = 0.73192, P74 = 0.73203 


Q.30 The deviations from X = 22.5 of different values of X are: — 12, — 8.5, 3.0, 0, 2.5, 
6.6, 9.2, 1.6, 0.5 and 0.4. Find the lower and the upper quartiles of variable X. 


Ans. Q: = 20.375, Qs = 26.4 


Q.31 Find median, upper and lower quartiles and 65th percentile for the distribution 
of heights of 300 boys. 


[Hight in inches [60 [60 [67 [66 [oo [oo [oo [mi [so [@ [a [a] 
Pro. otteve [a [3 [7 [e [2s foo [a5 [so [00 [oe [a0 fo [a | 


Ans. Median = 52.08, Qi = 50.34, Qs = 53.48, Pos = 52.94 


Q.32The daily wages for a group of 200 persons have a obtained from a 
frequency distribution of a continuous variable X, after making the 
substitution u = (X — 130) /20. 


a 2 


Calculate the median, Blawgs and upper quartiles, 3rd decile and 98th 
percentile. 


Ans. Median = 130.75, Qi = 117.2, Qs = 144.33, Ds = 120.75, Pos = 159.67 
Q.33 Calculate the median, 3rd decile and 20th percentile from the following data: 


[Gontetsie [28 | ts | ws | ite | ae | 
Poo fe gee, fa, | 
Ans. Median = 15,Ds= 11, P20 =8.61 
Q.34 Find the mode of each of the following sets of values. 
(i) 27, 29, 27, 25, 24, 27, 25, 29 (ii) 412, 426, 435, 412, 428, 435, 427 
(iii) 2, 2, 5, 7, 9, 9, 9, 10, 10, 11, 18, 12 (iv) 101, 106, 99, 108, 76, 87, 102, 93 
(v) 9, 12, 5, 4, 3, 6, 11, 7, 5, 2, 11, 9, 13, 7, 6, 8. 
Ans. (i) 27 (i) 412 and 435 ii) 9 (iv) no mode (v) no mode 
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Q.35 The following table, shows the number of men in various age groups with some 
form of paid employment in a village. The age recorded for each man is the 


number of completed years lived. 
Age(years) 21-30 | 31-40 | 41-50 71-80 
12 4 26 35 23 5 1 


Calculate the mode. 
Ans. 44.79 
Q.36 Compute mode of the data: 


Hourly wages 
No. of employees 


Ans, 8.96 


Q.37 Following is the distribution of the size of certain farms selected at random 
from a district, calculate the mode of distribution: 


fio [2 [0] 
he [a] 


Central size of the farm in acres 


Number of farms 


Ans. 45.71 
Q.38 The following table shows the distributions of the maximum loads in short tons 
supported by certain cables produced by a company: 


9,3 -9.7 

9.8 — 10.2 5 11.8- 12.2 
10.3 - 10.7 12.3 — 12.7 
10.8 - 11.2 12.8- 13.2 


Determine mean, median and mode. 
Ans. Mean = 11.092, Median = 11.073, Mode = 11,063 


Q.39 Find the value of mode by using the empirical relationship between averages 
for the following data: 


No. of students 


Ans. Mode = 33.5 
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Q.40(i) In a moderately asymmetrical distribution the mode and mean are 82.1 
and 35.4 respectively. Calculate the median. 


(ii) In a moderately asymmetrical series, the value of arithmetic mean and 
median is 20 and 13.67 respectively. Find out the value of mode. 


(iii) If the mode and mean.of a moderately asymmetrical series are respectively 
. 16cm. and 20.2 em., compute the most probable value of median. 


Ans. (i) 34,3 (ii) 16.01 (iii) 18.8 em. 

Q.41 Form a frequency distribution taking the variable as the number of letters in a 
word in the stanza given below and calculate the mean, the median and mode 
of the distribution. 

"Heights by great men reached and kept, 

Were not attained by sudden flight; 

But they while their companions slept, 

Were boiling upwards in the night". 
Ans. Mean = 4.8, Median = 5, Mode is illdefined 
Q.42 The expenditure of 100 families is given below : 


No. of families 


Mode for the distribution is 24. Calculate the missing frequencies. 
Ans. 23 and 21 
Q.43 Calculate the geometric mean of the following price relatives: 


ee Pulses 


Ans. 159.77 

Q.44 Calculate geometric mean for the following numbers: 
(i) 8, 40, 175, 1209, 2000 
(ii) 6.5, 169.0, 11.0, 112.5, 14.2, 75.0, 35.5, 215.0 
iii) 0.35, 3, 30, 300, 3000, 0.3, 0.03, 0.0362, 0.00482, 58642 

ns. (i) 168.39 (ii) 42.70 (iii) 4.38 

Q.45 The reciprocals of 8 different values of X are 0.0667, 0.1111, 0.0833, 0.0556, 
0.0500, 0.0357, 0.0278, 0.0222. Calculate (i) arithmetic mean (ii) geometric 
mean of X values. 

Ans, (i) 22.875 (ii) 20.082 

Q.46 The deviations of 10 different values from X = 22 are: 0, 2, — 3, — 4, 6, 5, 8, — 1, 
0, 3. Calculate the geometric mean. 

Ans. 23.31 
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Q.47 For the following data find the geometric mean. 
Weight in gm. | 60-80 | 80-100 | 100-120] 120-140] 140-160 160-180] 180-200] 


Ans. 105.61 
Q.48 Find geometric mean ft the following frequency distribution: 


Ans. 3.41 

Q.49 Find two numbers whose mean is 9.0 and geometric mean is 7.2. 

Ans, 14.4 and 3,6 

Q.50 The mean and geometric mean of three numbers are 7 and 4 ruspectively, Find 
all the three numbers if mean of first two numbers is 10. 

Ans, 16, 4 and 1 

Q.51The mean and geometric mean of three numbers are 34 and 18 respectively. 
Find all the three numbers when the geometric mean of first two numbers is 9. 

Ans. 27, 3 and 72 

Q.52A man gets a rise of 15 % in his salary after the first year of his service and a 


further rise of 20 % and 25 % in his salary at the end of his second and third 
year of service respectively. Find his average annual percentage increase. 

Ans. 19.92 %. 

Q.53 All children in a number of families were together at a party. Each child was 
asked to state the number of children in his (or her) family. Two children said 
one child, four children said two children and twelve children said three. 

(i) How many families were at the party? 
(ii) Calculate the arithmetic mean of the number of children per family. 

Ans, (i) 8 (ii) 2.25 ( 2 children approximately ) 

Q.54 The population of a country increased by 20 % in the first decade, 30 % in the 
second decade and 45 % in the third decade. What is the average rate of 
increase per decade in the population? 

Ans. 31.28 % 

Q.55If the industrial production increased 3 percent in first year, 4 percent in 
second year and 5 percent in third year, what is the average annual increase 
for the three years. 

Ans. 4% 

Q.56 The arithmetic mean of two items is 12.5 and geometric mean is 10. Find two 
items. 

Ans, 20 and 5 
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Q.57 The weekly incomes of 8 families in rupees in a certain locality are given below. 
Calculate the harmonic mean. 


[= [e[>] =]? ] 
Income( Rs.) | 2700 | 2000 | 5000 | 7500 | 1800 | 2500 | 4800 


Ans. 3071.02 
Q.58 Calculate the harmonic mean from the following data : 


4200 


Frequency 1 


Ans. 3.38 


Q.60 If the arithmetic mean of two numbers is 20 and their geometric mean is 16, 
find the harmonic mean. 


Ans. 12.8 
Q.61 Consider the following data: 


Compute the harmonic mean. 
Ans: 63.05 


Chapter 


4 
_MEASURES OF DISPERSION 


4.1. INTRODUCTION 


A modern student of statistics is mainly interested in the study of variability 
and uncertainty. In this chapter we shall discuss variability and its measures and 
uncertainty will be discussed in chapter seven. We live in a changing world. 
Changes are taking place in every sphere of life. A man of statistics doe. not show 
much of interest in those things which are constant. The total area of the earth may 
not be very important to a research minded person but the area under different 
crops, area covered by forests, area covered by residential and commercial buildings 
are figures of great importance because these igures keep on changing from time to 
time and from place to place. Very large number of experts are engaged in the study 
of changing phenomenon. Experts working in different countries of the world keep a 
watch on forces which are responsible for bringing changes in the fields of human 
interest. The agricultural, industrial and mineral production and their 
transportation from one part to the other parts of the world are the matters of great 
interest to the economists, statisticians and other experts. The change. in human 
population, the changes in standard of living, changes in literacy rate and the 
changes in prices attract the experts to make detailed studies about them and then 
correlate these changes with the human life. Thus variability or variation is 
something connected with human life and the study is very important for mankind, 
4.2, DISPERSION 

The word dispersion has a technical meaning in statistics, The average 
measures the centre of the data. It is one aspect of the observations. Another feature 
of the observations is as to how the observations are spread about the centre. The 
observations may be close to the centre or they may be spread away from the centre. 
If the observations are close to the centre (usually the arithmetic mean or median), 
we say that dispersion, scatter or variation is small. If the observations are spread 
away from the centre, we say dispersion is large. Suppose we have three groups of 
students who have obtained the following marks in a test. The arithmetic means of 
the three groups are also given below: 


Group A: 46, 48, 50, 52,54, X,=60 
Group B; 30, 40, 50, 60,70, X,=50 


Group C: 40, 50, 60, 70, 80, X,=60 
145 
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In group A and B arithmetic means are equal ie., X, = X, = 50. But in group A 
the observations are concentrated on the centre. All the students of group A have 
almost the same level of performance. We say that there is consistence in the 
observations in group A. In group B, the mean is 50 but the observations are not 
close to the centre. One observation is as small as 30 and one observation is as large 
as 70. Thus there is greater dispersion in group B. In group C the mean is 60 but the 
spread of the observations with respect to the centre 60 is the same as the spread of 
the observations in group B with respect to their own centre which is 50. Thus in 
group B and C the means are different but their dispersion is the same. In group A 
and C the means are different and their dispersions are also different. Dispersion is 
an important feature of the observations and it is measured with the help of the 
measures of dispersion, scatter or variation. The word variability is also used for this 
idea of dispersion. 

The study of dispersion is very important in statistical data, If in a certain 
factory there is consistence in the wages of workers, the workers will be satisfied. 
But if some workers have high wages and some have low wages, there will be unrest 
among the low paid workers and they might go on strikes and arrange 
demonstrations. If in a certain country some people are very poor and some are very 
rich, we say there is economic disparity. It means that dispersion is large. The idea 
of dispersion is important in the study of wages of workers, prices of commodities, 
standard of living of different people, distribution of wealth, distribution of land 
among farmers and various other fields of life. Some brief definitions of dispersion 
are: 

(1) The degree to which numerical data tend to spread about an average value is 
called the dispersion or variation of the data. 

(2) Dispersion or variation may be defined as a statistics signifying the extent of 
the scatter of items around a measure of central tendency. 

(3) Dispersion or variation is the measurement of the scatter of the size of the 
items of a series about the average. 

4.3. MEASURES OF DISPERSION 

For the study of dispersion, we necd some measures which show whether the 
dispersion is small or large. There are two types of measures of dispersion which are: 

(a) Absolute measures of dispersion. (b) Relative measures of dispersion. 
4.3.1. ABSOLUTE MEASURES OF DISPERSION 

These measures give us an idea about the amount of dispersion in a set of 
observations, They give the answers in the same units as the units of the original 
observations. When the observations are in kilograms, the absolute measure is also 
in kilograms. If we have two sets of observations, we cannot always use the absolute 
measures to compare their dispersion. We shall explain later as to when the 
absolute measures can be used for comparison of dispersion in two or more than two 
sets of data. The absolute measures which are commonly used are: 

G) The range (ii) The quartile deviation 

(iii) The mean deviation (iv) The standard deviation or variance. 
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4.3.2, RELATIVE MEASURES OF DISPERSION 

These measures are calculated for the comparison of dispersion in two or more 
than two sets of observations. These measures are free of the units in which the 
original data is measured. If the original data is in rupees or kilometers, we do not 
use these units with relative measures of dispersion. These measures are a sort of 
ratio and are called coefficients. Each absolute measure of dispersion can be 
converted into its relative measure. Thus the relative measures of dispersion are : 
(i) Coefficient of Range or Range Coefficient of Dispersion. 
(ii) Coefficient of Quartile Deviation or Quartile Coefficient of Dispersion. 
(iii) Coefficient of Mean Deviation or Mean Coefficient of Dispersion. 
(iv) Coefficient of Standard Deviation or Standard Coefficient of Dispersion. 
(v) Coefficient of Variation (a special case of Standard Coefficient of Dispersion). 
4.4. THE RANGE 

Range is defined as the difference between the maximum and the minimum 
observation of the given data, If Xm denotes the maximum observation, Xo denotes 
the minimum observation then the range is defined as 

Range = Xm—Xo 

In case of grouped data, the range is the difference between the upper 
boundary of the highest class and the lower boundary of the lowest class. It is also 
calculated by using the difference between the mid points of the highest class and 
the lowest class. It is the simplest measure of dispersion. It gives a general idea 
about the total spread of the observations. It does not enjoy any prominent place in 
statistical theory. But it has its application and utility in quality control methods 
which are used to maintain the quality of the products produced in factories. The 
quality of products is to be kept within certain range of values. 


The range is based on the two extreme observations. It gives no weight to the 
central values of the data. It is a poor measure of dispersion and does not give a good 
picture of the overall spread of the observations with respect to the centre of the 
observations. Let us consider three groups of data which have the same range : 

Group A: 30, 40, 40, 40, 40, 40, 50 

Group B: 30, 30, 30, 40, 50, 50, 50 

Group C: 30, 35, 40, 40, 40, 45, 50 

In all the three groups the range is 50 — 30 = 20. In group A there is 
concentration of observations in the centre. In group B the observations are friendly 
with the extreme corners and in group C the observations are almost equally 
distributed in the interval from 30 to 50. The range fails to explain these differences 
in the three groups of data. This defect in range cannot be removed even if we 
calculate the coefficient of range which is a relative measure of dispersion. If we 
calculate the range of a sample, we cannot draw any inference about the range of the 
population. 
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4.4.1, COEFFICIENT OF RANGE 

It is a relative measure of dispersion and is based on the value of range. It is 
also called range coefficient of dispersion. It is defined as: 

Coefficient of Range = aoe 

The range Xm — Xo is standardised by the total Xm + Xo. 

Let us take two sets of observations. Set A contains marks of five students in 
Mathematics out of 25 marks and group B contains marks of the same students in 
English out of 100 marks 

Set A: 10, 15, 18, 20, 20 

Set B: 30, 36, 40, 45, 50 

The values of range and coefficient of range are calculated as: 


Coefficient of Range 


Set A (Mathematics ) 20-10 = 


Set B ( English ) 50-30 = 20 


In set A the range is 10 and in set B the range is 20. Apparently it seems as if 
there is greater dispersion in set B. But this is not true. The range of 20 in set B is 
for large observations and the range of 10 in set A is for small observations. Thus 20 
and 10 cannot be compared directly. Their base is not the same. Marks in 
Mathematics are out of 25 and marks in English are out of 100. Thus, it makes no 
sense to compare 10 with 20. When we convert these two values into coefficient of 
range, we see that coefficient of range for set A is greater than that of set B. Thus 
there is greater dispersion or variation in set A. The marks of students in English 
are more stable than their marks in Mathematics. 

Example 4.1. 

Following are the wages of 8 workers of a factory. Find the range and the 
coefficient of range. 

Wages (in Rs.): 1400, 1450, 1520, 1380, 1485, 1495, 1575, 1440. 

Solution: 
Here, Largest value = Xn = 1575 and Smallest value = Xo = 1380 
Range= Xm-Xo = 1575-1380 = 195 
Xm— Xo 1575-1380 _ 195° 


Coefficient of Range = Xmn+Xo ~ 1575+1380 7 2955 = 0.066 


Example 4.2. 
Find the range of the weights of the students of 


| Weights (kilograms) | 60-62 | 63-65 | 


Ser 
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Class boundaries Number of students 
5 


59.5 - 62.5 61 
~1« 62,5 - 65.5 64 18 
65.5 — 68.5 67 
68.5 - 71.5 70 
71.6 - 74.5 73 


Solution: 


Weights ( kilograms ) 


Method 1: 
Xn = Upper class boundary of the highest class = 74.5 
Xo = Lower class boundary of thetewest « ‘ss = 59.5 
Range= Xn-Xo = 74.5-59.5 = 15 kilograms. 
Method 2: 
Xn = Mid value of the highest class = 73 
Xo = Mid value of the lowest class = 61 


Range= Xn—Xp = 73-61 = 12 kilograms. 
4.5. QUARTILE DEVIATION 
It is based on the lower quartile Q, and the upper quartile Q,. The difference 
Q; — Q, is called the inter quartile range. The difference Q; - Q, divided by 2 is 
called semi-inter-quartile range or the quartile deviation. Thus 


Quartile Deviation ( Q.D.) = B= a 


The quartile deviation is a slightly better measure of absolute dispersion than 
the range, But it ignores the observations on the tails. If we take different samples 
from a population and calculate their quartile deviations, their values are quite 
likely to be sufficiently different, This is called sampling fluctuation, It is not a 


in the population, 
4.5.1, COEFFICIENT OF QUARTILE DEVIATION 


A relative measure of dispersion based on the quartile deviation is called the 
coe/ficient of quartile deviation. It is defined as 


Qo=Qi 
2 Q:-Q 
Coefficient of Quartile Deviation = Gta BEG Ore Q; 
2 


It is a pure number free of any units of measurement. It can be used for 
comparing the dispersion in two or more than two sets of data. 
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Example 4.3 
The wheat production (in kgs.) per acre is given as: 
1120, 1240, 1320, 1040, 1080, 1200, 1440, 1360, 1680, 1730. 
Find the quartile deviation and coefficient of quartile deviation. 
Solution: 
After arranging the observations in ascending order, we get 


1040, 1080, 1120, 1200, 1240, 1320, 1360, 1440, 1680, 1730 


Q: = Value of (ae 1) thitem = Value of (22 1) th item 
= Value of (2.75) th item 
= 2nd item + 0.75 ( 8rd item — 2nd item ) 
= 1080 + 0.75( 1120 - 1080) = 1080+ 80= 1110 
+ 
= Value of 222th item = Value of 49 th item 
= Value of (8.25) th item 
= 8th item + 0.25 ( 9th item — 8th item ) 
= 1440 + 0.25 (1680-1440) = 1440+60 = 1500 
Quartile Deviation = Qs— Qt z a 4500-1110 z — 390 = 195 


; _ Qs=Qi — 1600=1110'_ 390: _ 
Coefficient of Q.D. = Q:+Qi = 150041110 ~ 2610 ~ 0.15 


Example 4.4, 


Calculate the quartile deviation and coefficient of quartile deviation from the 
data given below. 


9.25 — 9.75 11.25 - 11.75 


9.75 — 10.25 11.75 — 12.25 


10.25 — 10.75 12.25 — 12.75 


10.75 — 11.25 12.75 — 13.25 
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Solution: 
The necessary calculations are given below: 


Maximum Load (Short-tons ) | No. of cables (f) Cumulative frequencies (c.f) 


9.25 — 9.75 « 2 
9.75 — 10,25 2+5 =7 
10,25 — 10.75 7+12 = 19 
10.75 — 11.25 19+17 = 36 
11.25 — 11.75 86+14 = 50 
11.75 — 12.25 50+6 = 56 
12.25 — 12.75 56+3 = 59 
12.75 — 13,25 


Q, = Value of(2) thitem= Value of( 2) thitem = 15th item 
Q, lies in the class 10.25 — 10.75. 


See 


where / = 10.25, h = 0.5, f = 12, 


n 
ge ee 1S and c=7 


Thus, Q, = 10.25 + 23 (15 - 7) = 10.26+0.33 = 10.58 


Q; = Value of( BY en item = Value of 3160) th item = 45th item 
Q; lies in the class 11,25 - 11.75. 


ec 


where | = 11.25, h= 0.5, f = 14, 


co 


in 
4G rine and c = 36 


Thus, Q, = 11.25 + rd (45-86) = 11,25+0,382 = 11,57 


Quartile Deviation (Q.D.) = at 1 os ALST=10.58 Cy 2.98 = 0.495 
: _ Qs-Qi _ 1157-1058 _ 0.99 _ 
Coefficient of Q.D. a oe@u 115771068 = 2215 0.045 
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4.6. THE MEAN DEVIATION 

The mean deviation or the average deviation is defined as the mean of the 
absolute deviations of observations from some suitable average which may be the 
arithmetic mean, the median or the mode. The difference ( X - average ) is called 
deviation and when we ignore the negative sign, this deviation is written as 
|X — average| and is read as modulus deviation. The mean of these modulus or 
absolute deviations is called the mean deviation or the mean absolute deviation, 


Thus for a sample data in which the suitable average is the X, the mean deviation 
( M.D.) is given by the relation: 


E|X-X 
mp = SSE 
For a frequency distribution, the mean deviation is given by 


mp, = 2lx=x 


=f 
when the mean deviation is calculated about the median, the formula becomes 


M.D. ( about median ) = 2 es ea 


=f 


The mean deviation about the mode is 


X— Mod. 
M.D. ( about mode) = Ree ea 


=f 


For a population data the mean deviation about the population mean pis 


mp. = zZlX=x 


=f 


The mean deviation is a better measure of absolute dispersion than the range 
and the quartile deviation. 
A drawback in the mean deviation is that we use the absolute deviations 


| X — average| which does not seem logical. The reason for this is that £( X — X ) is 


always equal to zero. Even if we use median or mode in place of X, even then 
£( X — median ) or £( X — mode ) may be zero or approximately zero with the result 
that the mean deviation would always be either zero or close to zero. Thus the very 
definition of the mean deviation is possible only on the absolute deviations. 

The mean deviation is based on all the observations, a property which is not 
possessed by the range and the quartile deviation, The formula of the mean 
deviation gives a mathematical impression that it is a better way of measuring the 
variation in the data. Any suitable average among the mean, median or mode can be 
used in its calculation but the value of the mean deviation is minimum if the 
deviations are taken from the median. A serious drawback of the mean deviation is 
that it cannot be used in statistical infer’ 
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4.6.1. COEFFICIENT OF THE MEAN DEVIATION 

A relative measure of dispersion based on the mean deviation is called the 
coefficient of the mean deviation or the mean coefficient of dispersion. It is defined as 
the ratio of the mean deviation to the average used in the calculation of the mean 
deviation. Thus 


Mean Deviation from Mean 

Mean Veviation from Mean 
Mean 

Mean Deviation from Median 

Sean —Swaation krom Median 
Median 


2via til cl 
Coefficient of M.D.( about mode) = Mean Devistion from Mode 


u 


Coefficient of M.D. ( about mean ) 


Coefficient of M.D.( about median ) 


Example 4.5, 

Calculate the mean deviation from (i) arithmetic mean (ii) median (iii) mode in 
respect of the marks obtained by nine students given below and show that the mean 
deviation from median is minimum. 

Marks ( out of 25 ): 7, 4, 10, 9, 15, 12,.7,.9, 7. 

Solution: 
After arranging the observations in ascending order, we get. 
Marks: 4, 7, 7, 7, 9, 9, 10, 12, 15. 
=X_ 80 
Mean = ng = 888 
# Oct 
Value or (® 2 ") thitem = Value ot( 2 ) th item 
= Value of (5)thitem = 9 
Mode = 17(since 7 is repeated maximum number of times ) 
ee 


Median 


" 


Marks (X) |X-Mean | |X — Median | A |X - Mode | ‘| 
4 4.89 5 3 
7 1,89 2 0 
7 1.89 2 0 
7 1.89 2 0 
9 0,11 0 2. 
9 0.11 0 2 
10 1.11 1 3 
12 3.11 3 5 
fhouaih BB) 6.11 6 8 
Total “S| wonatanionge| —— aa Beis al 
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MLD; feateam eee X= Mean = 2H L235 
= | X—Median 


M.D. from median 
= | X—Mode | 
M.D. from mode z | X—Mode 


From the above calculations, it is clear that the mean deviation from the 
median has the least value. 
Example 4.6 


Compute the mean deviation from median and its coefficient from the following 
data: 


[ Daily wages (Rs}) | 200 - 250] 250 - 300 | 300 - 350 350 - 400 | 400 - 450 
| No.ofpereons [10 [20 | 40 [ 2 | 10 | 


Solution: 
The necessary calculations are = below: 


= 2.56 


9 
21 
ret = 2.33 
23 
9 


200 — 250 
250 — 300 


300 — 350 
350 — 400 
400 — 450 


Median = Value *) th item = Value ot (202) thitem = 50th item 


Median lies in the class 300 — 350 
Median 1+8 (3-<) = 300+ & (50-30) =300+25 = 325 
xf |X-median| _ 4000 


M.D. from median = if = 00 = 40 
7 . _ M.D. from median _ 40 _ 
Coefficient of M.D. from median = “~"“Vyegian = 325 = 0.123 


4.7. THE VARIANCE 

The variance of the.observations is defined as the mean of the squares of 
deviations of observations taken from the,mean of the observations. For a sample 
data the variance is denoted by S? and the population variance is denoted by o” 
(sigma square). By definition, th~ sample variance S? has the formula 
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fo 4 x(x- x )?, where X is the sample mean and n is the number of observations 


1 
in the sample. The population variance o” is defined as o? = N X( X —p)? where p is 


the mean of the population and N is the number of observations in the data. It may 
be remembered that the population variance co’ is usually not calculated. The 
sample variance S? is calculated and if needed, this S” is used to make inference 
about the population variance. 


The term £( X — X )? is positive, therefore S? is always positive. If the original 
observations are in cms, the value of the variance will be (cms)”. Thus the unit of S? 
is the square of the units of the original measurement. For a frequency distribution 
the sample variance S’ and population variance o” are defined as: 


Ss? = 4 Ef X- XK)? and o?= # BK X-1)* 


4.8. STANDARD DEVIATION 


The standard deviation is defined as the positive square root of the mean of the 
squares of the deviations of observations from their mean. For the sample data the 
standard deviation is denoted by S and is defined as: 


af =. 
S = Mare re) . 


For a population data the standard deviation is denoted by o(sigma) and is 
defined as: 4 


o = \/eE(X-n)? 


For a frequency distribution the formulas become 


1 ~ 
pa \ | 5g 1KX-X)? and o = Hau x —w)? 


Thus the standard deviation is the positive square root of the variance, The 
standard deviation is in the same units as the units of the original observations. If 
the original observations are in gms, the value of the standard deviation will also be 
in gms. 

The standard deviation plays a dominating role for the study of variation in the 
data. It is a very widely used measure of dispersion. It stands like a tower among 
other measures of dispersion. As far as the important statistical tools are concerned, 


the first important tool4s the mean X and the second important tool is the standard 
deviation §. It is based on all the observations and is subject to mathematical 
treatment, It is of great importance for the analysis of data and for the various 
statistical inferences. 
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4.8.1. CHANGE OF ORIGIN 


The standard deviation has a strange property that its value does not change if 
some constant is added to all the observations or séme constant is subtracted from 
all the observations. We can say that the standard deviation and the variance are 
independent of change of origin. The standard deviation of 1, 2, 3, 4, 5 will be equal 
to the standard deviation of 101, 102, 103, 104 and 105. The reason is that the 
difference of 1, 2, 3, 4, 5 from their mean which is 3 is the same as the difference of 
101, 102, 103, 104 and 105 from their own mean which is 103. 


If Y = X +a where 'a’ is an arbitrary constant then Y = X+a and 


7 ¥y yz 4 2 xX 2 
Avie = \ [2 Xta=X=al’ _~ [5 X-X)* 
SD.Y) = ae a z Keak is x AX =SD(X) = & 


=a 
Thus S.D(X+a) = S.D.(X) and Var(X+a) = Var(X) 
Similarly, S.D.(X-a) = S.D.(X) and Var(X-a) = Var(X) 
4.8.2. CHANGE OF SCALE 


When all the observations are multiplied by a certain constant or divided by some 
constant, the process is called change of scale. Let us multiply X by a constant 'b'. 


“4 fe ee 2 
I Y=bX then ¥ = bK and sp y) = \/2%=¥ 


Putting the values of Y and y, we get 


EDX — bX ? 2(b!X? + b¢X? ~ 2b°XX 
copa (SRE fea 

2 ¥( X? + X?- 2xX =X) 

b? 3(X +x 2XX x 1 2B spo 


Therefore S.D.(Y) = S.D.(bX) = bS.D(X) and Var(bX) = b? Var (X) 
4,8.3. CHANGE OF ORIGIN AND SCALE 
Both the operations of change of origin and scale can be applied at the same 


time. Thus if we take X — a, it is change of origin and if we take x= a it is called 


change of origin and scale. 
Let Xaa be denoted by u where u is called the coded variable. 


Thus u = Xa and X = at+huand X = a+hu 
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eee 
By definition 


a, 2 - 
8.D.(X) = Ss 2 Putting the values of X and X, we get 


n 


ear py 
$.D.(X) =-\ tatu = yn fee 


Similarly Var(X) = h? Var (u) 
4.8.4, VARIANCE OF A CONSTANT 


Let all the n observations be equal to a constant, say C, Then G = ( 


2 0-6/2 —Cy 
By definition $.D.() = \|#2=S2" _ . fHS= Cy _ 0 Thus Var (C)= 0 


4.8.5, OTHER FORMULAS FOR CALCULATION OF S AND S? 


= hS.D.(u) 


be ee ah a | =x)? 
By definition S = NY ( X= 3) orS= BEC o SE (for frequency distribution) 


These formulas should be used when X has some exact value like X = fe =4 


ae = 20 
or X= Spas 4.5.1f X= ‘ee 3.3333, which is non-terminating, there will be some 
error and also some inconvenience in the calculation of §. The formula for § can be 


written in some other forms in which we do not have to take the deviations from x 
Some other formulas for the calculation of § are obtained as: 


J gs = \/RK=KY _ 4 /ELX?# Xt oxX] 
@ i n rf n 
2 X2 _ ox SK? 
z; =X +ak 2X EX _ =X + X98. X 
- \fE-2 - Ey 
n n n 
7 = . /2X2_(2X¥ » . EX? (EX) EX'S, 
Thus § = Fi (2) and S* = a = = Te x 


Thus the variance is the mean of the squares of observations minus the square 
of the mean, The above expression for ZX? can be written as 
=X? = n(S?+ X?) 
For frequency distribution 


a zfX? (2a tock He (SS 
ea a5 eal ba) AAC ena a Ga 


It is very convenient to use this formula when the X values are small. 
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(ii) Ifwetake D = X-—a, then D = X-a 
We have X = Dta and X=a+D 
By definition * 
-xy 
S.D.(X) = S = \j2=XE 


D[D+a-a~D x(D-D 
Sx = = = 5 = 
2 2 2 (rp) 
Thus, Sx = Sp = 2D* _ (2D) and S?= S? = 2D? _ (=D) 
For frequency distribution the formulas would be 
i .: rfD? (xfby eS 2 _ (ED) 
Sx = S,)= Sf ( Sf ) and Sy=S)= Tf 7 


These formulas are based on the property of standard deviation that the 
standard deviation does not change by change of origin. Standard deviation of X is 
equal to the standard deviation of D where D =X + a. 


Gi) Ifwetake u = X=2 | then X = a+huand X=athi 
| x2 
By definition Sx = HAAKY . Putting the values of X and X in the formula, 


we get : 


2(a+hu-—a~hu)? L(u-u)? 
. S. = 7 =n a 
sake But _(Zu dis? = [Be _(22)'] 
ae n n besa a n n 


For frequency distribution, the formulas for S and S* would be 
a ee eee J] 
Se aN nae e) anne v/s Sf 
These formulas involving the coded variable 'u' are particularly suitable for 


grouped data. We convert the variable X into another coded variable u where 


X- 5 ‘ 
u= ae , 'a' is arbitrary constant preferably the value of X somewhere in the 


middle of the X values. The divisor h is the interval of the grouped data. If the 
interval is not constant, we can take any suitable value of h. 
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4.9, COEFFICIENT OF STANDARD DEVIATION 


The standard deviation is the absolute measure of dispersion. Its relative 
measure is called standard coefficient of dispersion or coefficient of standard 
deviation. It is defined as; 


Coefficient of Standard Deviation = 


x1 |a 


4.10. COEFFICIENT OF VARIATION 


The most important of all the relative measures of dispersion is the coefficient 
of variation. This word is variation not variance, There is no such thing as coefficient 
of variance. The coefficient of variation (C.V.) is defined as 

. ae S 

Coefficient of Variation(C.V.) = —— x 100 

xX 


Thus C.V. is the value of S when X is assumed equal to 100. It is a pure 
number and the unit of observations is not mentioned with its value. It is written in 
percentage form like 20 % or 25 %. When its value is 20 %, it means that when the 
mean of the observations is assumed equal to 100, their standard deviation will be 
20. The C.V. is used to compare the dispersion in different sets of data particularly 
the data which differ in their means or differ in the units of measurement. The 
wages of workers may be in rupees and the consumption of meat in their families 
may be in kgs. The standard deviation of wages in rupees cannot be compared with 
the standard deviation of amounts of meat in kgs. Both the standard deviations need 
to be converted into coefficient of variation for comparison. Suppose the value of C.V. 
for wages is 10 % and the value of C.V. for kgs of meat is 25 %. This means that the 
wages of workers are consistent. Their wages are close to the overall average of their 
wages. But the families consume meat in quite different quantities. Some families 
use very small quantities of meat and some others use large quantities of meat. We 
say that there is greater variation in their consumption of meat. The observations 
about the quantity of meat are more dispersed or more variant. 


4.11. PROPERTIES OF STANDARD DEVIATION AND VARIANCE 


(i) The standard deviation and the variance are positive quantities. The standard 
deviation is expressed in the same units as the units of observations and the 
variance is expressed in square of the units of the observations. 


(ii) The standard deviation and the variance are zero if all the observations have 
some constant value. If‘C’ is a constant, then 


S.D(C) = 0 and Var(C) = 0 
(iii) The standard deviation and the variance do not change by change of origin. 
Thus §.D.(X +A) = S.D.(X) and Var (X +A) = Var (X) 
Also §.D.(X-A) = S.D.(X) and Var(X — A) = Var (X) 
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(iv) The standard deviation and the variance are affected by change of scale. 
Thus §.D.(bX) = |b] S.D.(X) and Var (bX)= b? Var (X) 


xX xX 1 
and sp{¥) = fa S.D.(X) and Var(¥) = pe Var (X). 


(v) If we have two sets of data with n, and n, observations having means X, and Xe, 
with variances Si and s}, respectively, then the combined standard deviation, 
S, of all the n = n, + ng observations is given by 
2 2 
niS; + naS: ss 2 
Be Nf tee ag CX) 


= nitn (mi + nz)? 
and the combined variance 8? is given by 
2 2 
2  miS; +n2S_ ning 


8, = ni+nz (ni + nz)? 


teed 
(Xi-X2) 
(vi) IfX and Y are independent random variables then, 

Var(X+Y) = Var(X)+Var(Y) and Var(X-Y) = Var (X) + Var (Y) 


Note: Independence of random variables has been discussed in Chapter 8. Students 
should know that if X and Y are independent random variables, then 
E(XY) = E(X) . E(Y) 


Example 4.7 


A manufacturer of flashlight batteries took a sample of 5 batteries from a day's 
production and used them continuously until they were drained. The number of 
hours they were used until failure were 342, 426,317, 545, 630. Compute the 
variance and coefficient of variation. 


Solution: 
The necessary calculations are given below: 


12100 
676 
18225 
8649 
31684 


X( X-X)?=71334 


u X-R)=0 
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= EX_ 2260 _ 
X = ae ee = 452 
-~x) 
S? = x xs a = 1834 = 14266.8, S = 14266.8 = 119.4437 
CV, = 8, 100 = eloieeni 100 = 26.43 % 
s 452 
xX 
Example 4.8, 


(i) Calculate standard deviation and coefficient of variation in respect of the 
marks obtained by 10 students given as: 50, 55, 57, 49, 64, 61, 64, 59, 58, 56. 

(ii) How would your results be affected if it is decided to increase the marks of each 
of the above students by 5, 


Solution: 

(i) The necessary calculations are given below: 

4 50 | 55 57 | 49 | 54 | G1 | 64 | 59 | 58 56 | Total 

D=X-56 | -6 -1 1 -7/-2 5 8 3 a 0 3 

D? 36 il 1 49 4 25 64 9 | 4 [ 0 193 
me a+2 = 56+ & = 56+03 = 563 


(0) 
193 2 
= \f(2y = yissran 
V19.21 = 4.383 


s 4.6 
Coefficient of Variation (C.V.) = — x 100 = 4383 100 = 7.78% 


fl 


(ii) When each item is increased by 5, the value of the arithmetic mean would be 
increased by 5 because when a constant is added to all the items, the value of 
arithmetic mean is increased by the constant added. Thus the arithmetic mean 
would be 56.3 + 5 = 61.3, The value of the standard deviation would not be 
affected when every item is increased by 5. However, the value of the 
coefficient of variation would be affected because mean is increased by 5. 


Coefficient of Variation (C.V.) = & x 100 = rice x 100 = 7.15 % 
x ; 


Example 4.9, 


The arithmetic mean and standard deviation of a series of 20 items were 
calculated by a student as 20 cm. and 5 cm, respectively. But while calculating them 


an item 13 was misread as 30. Find the correct values of arithmetic mean and 
standard deviation. 
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Solution: 


eS or =IX= nX 


Incorrect X = 20, Incorrect EX = nX = 20(20) = 400 
Correct <X = Incorrect 5X — Wrong item + Correct item 
400-30+13= 383 

= Correct 2X _ 383 

Correct x = = = "90 


gt. HE _(2k)', 


= 19.15 


We have a = pip Ce or IX? = n[S? + (X)?] 


Incorrect ZX? = 20[ 25+ 400 ] = 20(425) = 8500 
Correct =X? = Incorrect 2X?- (Wrong item )? + ( Correct item )? 
= 8500-(30)?+ (13)? = SN 900+ 169= 7769 


2 
a, ae ake =X: ~ (Comest x) » eo f. (ey =21.73 
Correct S = 2173 = 466 


Hence, the correct values of mean and standard deviation are 19.15 and 4.66 
respectively. 
Example 4.10. 
(a) Find the mean, variance, standard deviation and coefficient of variation of 
the first n natural numbers 1, 2, 3, ..., n. 
(b) Computer the mean, variance and coefficient of variation of the set of 
numbers 218, 209, 212, 211, 215 by using 208 as assumed mean. 


(c) For a set of 10 numbers ©( X —X )? = 160 and YX? = 1160. Find coefficient 
of variation. 
(dq) The time (in minutes) that a random sample of 12 employees took to 
complete a task is: 
13, 49, 40, 28, 23, 15, 17, 21, 37, 66, 30, 45 
Compute the mean and standard deviation for this sample. Also compute 


the percentage of observations lying within limits X+S, 


Solution: 
(a) We have = 
SKS 14 nee 


2 
DX? = 124 224 92+ pn = BE DGnt 1) 


6 
x- tk a(t D2 _ nt 
Ba n 4 
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pa ER _(ZEY oat Gn +1)_[atne td) 
Pe may) 6n 2n 
gra (Btii(gntt) (ntty ati(met att) 


6 2 2 3 2 
gi = Btt ante dna} _(as ita) at gaa [tt 
eos 6 eee Ne 


Coefficient of variation (C.V) == 4x 100 


NVG@t=1y/12 | [a+ 1m = 1/2 (n+ 1)(n=1)/12 [—a=3 n-1 
= “(nt 12 (n¥1)(n+1)/4 ~ \3(n4+1) 
(b) The necessary calculations are given below: 
Here A = 208 


Sa arte 


=2Dt_ (2D 15 _ (28 e a 
gia 4) = oR UB) = 36-25 = 10 


Coefficient of variation (C.V) =-§-*100 = 29 x 100 = 1.55 


213 
() Here, n= 10, Y(X-X)? = 160 and EX?= 1160 
g: = RK X _ 160 “10 =16 s=i6 =4 


Rath ae 116 - (X)? or (X)?= 116 - 16 = 100 or X= 10 


ov =-$x100 =4 x 100 = 40% 


(d) Heres n= 12, YX = 384 and DX? = 15088 
=k = 384 = 32 


e ce Tees - 
5 ECE == = 15.28 


X4S = 82+ 15.28 = 16.72,47.28 
Number of observations lying within these limits = 8 
Thus, proportion of observations = 8/12 = 0.67 or 67% 
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Example 4,11. 
@) Coefficient of variation of two series are 58 % and 69 %, their standard 
deviations are 21.2 and 15.6 respectively. What are their arithmetic means? 


(ii) For a set of 100 observations, the sum of the deviations from 4 cm. is -11 cm. 
and the sum of the squares of these deviations is 257 cm. Find the coefficient of 
variation. 


Solution: 
G) For Series I: 


cy. = &x100 
x 


| Here, GV. = 58% and S=212 
21.2 


° Hence, 58 = *100 or 58X = 2120 or X = ==" = 36.55 


For Series II: 


Ovi S x 100 
x 


Here, CV = 69% and S=156 


15. Z = 
Hence, 69 = 38 100 or 69X = 1560 or X= 4580 = 22.61 
a x 
(i) Here, n = 100, A = 4, =D = -11, xD? = 257 
a =D _ C= sf 
X = Ati = 4+ S990. = 4-011 1= 3.89 
 [eb?_(zp)* _ , /257_(=117 ; 
Boe n (2) 5 Gas) = 16 
C.V. = fe x 100 = a x100 = 41.13% 
= 3.89 


Example 4,12. 


(i) After settlement the average daily wages in a factory had increased from 
Rs.120 to Rs,180 and the standard deviation had increased from Rs. 2.5 to 
Rs. 3.75. After settlement the wages have become higher and more uniform. 
Comment. 

(ii) Examine whether the following data for obtaining the variance are consistent 
or not. 


n=120, EX=-125, =X*=128. 
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(i) Before settlement the average daily wages were Rs.120 with standard 
deviation as Rs.2.5. So coefficient of variation is, 


8 2.5 
CV. = =x100 = 120 x 100= 2.083 % 


x 


After settlement average daily wages become Rs.180 and standard deviation is 
Rs.3.75. So coefficient of variation is, 


CV. = S x 100 = Cte i A 2.083 % 
= 180 


Thus it is incorrect to say that the wages have become more uniform. 
Variability remains the same as the coefficient of variation remains the same. 
Gi) We have, 


=X? =x\ 128 125)" 
at =! = =| 2 
S's n ( n 120 ( a4 aes 


Variance can never be negative, therefore the data is inconsistent. 

iii) Here,n = 20, EX?= 2000, § = GiiSib= 636 
a es ERS ONE Ks OE 

Sy Hei 


2000 - = = 3 
36 = os ~(X)? = 100-(X)}# or (X)?=100-36=64 or X=8 


Total amount recovered = =X = nX = 20(8) = Rs.160. 


Hence, each of the passengers shall have to pay Rs.8 as fine. 
ixample 4,13 


ii) increasing all scores by 10 % (iii) decreasing all scores by 10 %,. 
jolution: 
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(i) By adding 10 points to all scores, we get Y=X+10, then 
Var(Y) = Var(X+10) = Var(X) + Var(10) = 8+0=8 
(ii) Increasing all scores by 10 %, we get Y=1.1X, then 
Var(Y) = Var(1.1X) = 1.21Var(X) = 1.21(8) = 9.68 
(iii) Decreasing all scores by 10 %, we get Y=0.9 X, then 
Var (Y) = Var(0.9 X) = 0.81 Var(X) = 0.81(8) = 6.48 
Example 4.14 
An exercise physiologist measured the heart rates of 20 people who had been 
placed on a long distance running program. A frequency distribution for these rates 
is displayed below: 
Rate (X) 
| Frequency (f ) 
Compute the sample standard deviation and coefficient of variation of the heart 
rates. 


Solution: 
The necessary calculations are given below: 


67 


68 
69 


a Eex 1418 _ 
k= SF = gp = 709 
_, feex?_ (2EXY _ ., [100618 (1418 _ 
e = fE-(F a 20 ~ (AREY = 202 
cv.= © x100 = 22% x 100 = 2.85% 
= 70.9 
x 
Example 4.15. 


Calculate the standard deviation and coefficient of variation of the following 
distribution taking D = X — 32.5. 


Cage (yearn) [20-28 [ 25-00 | 00-95 | 85 —00 | 40-46 | «5-00 
Frc ofperoons [170 | wo | oo | « | o | | 
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Solution: 
The necessary calculations are given below: 


CE i MAE A 


X = A+ Hp = 32,5 + S100) _ 30.5 2.292 = 30.208 


480 
_ . /2fD? 2) ? _ . [32750 _(—1100)" 
8 450 4s0 7 = 7986 
8 S ) 
CV. = z x 100 = 30.208 * 1009 = 26.27 % 
Example 4.16 


Compute the standard deviation using step deviation method of a sample of 
ages of restaurant workers who are insured under a group health insurance policy. 


[ages [15-24 | 25-04 | 
[Freaueney | a2 | 7 | a | m | 2 


Solution: 
The necessary calculations are given below: 
A=39.5, h= 10 
_ X-39.5 
10 


fut (pfuy | _. [250 _ 
ify 150 


2 
4) x10 = 12.88 
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4.12. SHEPPARDS CORRECTIONS 

In grouped data the different observations are put into the same class. In the 
calculation of variance or standard deviation for grouped data, the frequency f is 
multiplied with X which is the mid-point of the respective class. Thus it is assumed 
that all the observations in a class are centered at X. But this is not true because the 
observations are spread out in the class. This assumption introduces some error in 
the calculation of S’ and S. The value of S? and S can be corrected to some extent by 
applying sheppards correction. Thus 


S*(corrected) = 


where h is the uniform class interval 


S(corrected) 


This correction is applied in grouped data which have almost equal tails in the 
start and at the end of the data. If data has a longer tail on any side, this correction 
is not applied. If size of the class interval ‘h' is not the same in all the classes, the 
correction is not applicable. 

4.12.1. CORRECTED COEFFICIENT OF VARIATION 

When the corrected standard deviation is used in the calculation of the 
coefficient of variation, we get what is called the corrected coefficient of variation. 
Thus 

Corrected coefficient of variation = S(corrected ) x 100 
- xX 
4.13. COMBINED VARIANCE 

Like combined mean, the combined variance or standard deviation can be 

calculated for different sets of data. Suppose we have two sets of data containing n, 


and n, observations with means x, and X, and variances S; and Sz. If Ke is the 


. ax ce “ 4 
combined mean and S, is the combined variance of n, + n, observations, then 
cottbined variance is given by 


mS; + noSe + ni( Xie) en Menke 


S. = mi+ny 
c a m| $2 Ge | + na|_Sit (XK, | 
It can also be writtenas S, = ni Fins 


mX1 ie n2X2 
ni+nz 
The combined standard deviation S, can be calculated by taking the square 


where x = 


2 : ; f 
root of S.. The formula for combined variance can be generalized as: 


2% se l= Mt 

Eni [ Si + (Xi -X. )?2 = _ rniX 

s = : eee =] here i =1,2,3,+-,kand X. = aut 
Eni =Inj 
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Example 4.17. 


@) 


(ii) 


For a group of 50 male workers, the mean and standard deviation of their 
hourly wages are Rs. 63 and Rs. 9 respectively. For a group of 40 female 
workers, these values are Rs.54 and Rs.G respectively. Find the mean and 
variance of the combined group of 90 workers. y 

A distribution consists of three components with frequencies 200, 250 and 300 
having means 25, 10 and 15 and standard deviations 3, 4 and 5 respectively. 
Find the coefficient of variation of the combined distribution, 


Solution: (i) Here, n, = 50, Ke = 63,8, = 9, s? = 81 


Gi) 


n, =" 40; XK) ='"54)'8, =" 6; 8? = 36 


Z, = wXitmX: _ 5068) +40(54) _ 5310 


Combined AM. = Xe mtn = 60440 SOUS pace 
: 2 nl si (%,- Eilts sh (%- X)'] 
Combined Variance = Ss. = 
nit nz 
_ 50 [81 + (63 ~59)2] + 40 36 + (54 ~ 59)? 
Bi 50+ 40 
= 4850+ 2440 _ 7290 _ 
7 90 apeO0) ace es 
Here,n,= 200, X,= 25,8, = 3, SPs'%9 
n,= 250, X,= 10,8, = 4, $= 16 
n,= 300, X,= 15,8, = 6, Sh=i26 
Combined A.M, = BXi+meXe+ nsXs _ 200(25)+250(10)+300(15) _ 12000 _ |, 


ni t+ nz+ ng 200 + 250 + 300 © 50 
Combined Variance 

mi[ $2 (KX, y]+ wal Sit (%-X,)']+ tut Sot (%-x,)'] 

mit+ne+ ng 
200 [9+ (25-16) 2] + 250 [16+(10-—16 2] + 300 [25 + (15-16)? 
200 + 250 + 300 
18000 + 13000 + 7800 38800 
ee  __— 38800 _ 
750 750 = 51.7338 

Combined standard deviation = V51.733 = 7,193 


; _ Combined standard deviation’ _ 7193 ris 
Combined C.V, = ConnaetANE x 100 = 16 100 = 44, 
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an 


4.14. MOMENTS 
Moments are defined as the mean of the different powers of the deviations of 
the observations taken from their mean. For sample data, the first four moments 


about the mean X are defined as 
- Xx) : -~xp 
i ah ice Dg Garages i= a(X- X* 


n n 
- x= xX» | (x- X) 
m3= " m= n . 


For a frequency distribution the first four moments about the mean X are defined as 
ie w= Ey 
= BOSD) © ocatways) m= Ka? 


ge Zf if 
_ Ef(X- X)? Bf(X- X)4 
m= Sf m= xf . 


The terms m,, m,, m, and m, are called first, second, third and fourth 
moments about the mean. They are also called the central moments or the mean 
moments. Value of the first moment m, is always equal to zero. The second moment 
m, = S? (Variance) and is always positive. The third moment m, can be negative, 
zero or positive. For a frequency distribution, when 

m, = negative, the distribution has negative skewness, 
m,= 0, the distribution is symmetrical, 
and if m,= positive, the distribution is positively skewed. 


In general, the’rth moment about the mean X is defined as 


= 2E(K Re 
ware rf 
- xy 
Ifwe put r=0, we get mp = = x5 xX = 1. Thus my is always equal to 1. 


The moments about the mean are calculated to describe different 
characteristics of the distribution. Moments of the order higher than four can also 
be calculated but the second, third and fourth moments are the most important. 
With the help of these moments, we can find the spread and shape of the 
distribution. 

For population data, the moments for a frequency distribution are defined as: 

rEx- rf(X-p)? 


= 7 = 0 (always) B= Tf =o 
xrf(X-p)3 tf(X—-p)* pix 
y, = Bee p= ESE wheren = 
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4.14.1. MOMENT RATIOS 


Some ratios are calculated with the help of moments about the mean, For 
sample data, the important ratios are b, and b, where, 


ue a and b= = 
3 mi 


bi is called moment coefficient of skewness and b: is called moment coefficient 
of kurtosis, a term which will be explained later. The value of b, is either zero or 
positive. Its value cannot be negative. When b: = 0, the distribution is symmetrical 
when bi > 0, the distribution is either positively or negatively skewed. 

Thus b, measures the magnitude of skewness. The direction of skewness is 
not determined by bi. This direction is determined by the algebraic sign of ms. The 
corresponding ratios for the population data are f, ( beta one) and , (beta two), where 

2 


itt 
Ca | and fz = 


4.14.2. RAW MOMENTS 
The moments about the mean X are calculated by taking deviations of the 


observations from the mean X. Sometimes the mean X is not a whole number or the 


= 4 
mean contains many decimal places. Suppose X = mt = 400 = 66.66667. When 


it is rounded off up to two places of decimals, we write X= 66.67. Now this value 
of X is approximate and any calculation based on this value will have some error. If 


we write the value of X, say, up to four places of decimals, the computational work 
requires a lot of time and labour. We can overcome this problem by calculating 
moments called raw moments. First we calculate the raw moments and then these 
raw moments are used to calculate the moments about the mean. 

Raw moments are of different types. For sample data, the first four raw 
moments are denoted by m},m3, m3, and m4, For population data, the first four raw 


moments are denoted by pj, 1g, Hy, and p14. 
4.14.8. RAW MOMENTS ABOUT THE ARBITRARY ORIGIN 

Suppose we take some item of the given data and denote it by ‘A’. It is called 
an arbitrary origin or the provisional mean, The arbitrary origin is taken 
somewhere in the middle of the given data. The deviations of the observations are 
taken from the origin ‘A’. Let the deviations be denoted by D where D= X-A. The 
first four raw moments about the origin at X = A are given by 


is SED ,_ EfD? ivf aE DE tir) DS 
a ees ie Sega Wwe RNS rina ovat 
: cSt Bs 1, 2fDt 
In general, the rth moment about the arbitrary origin is written as m,= “Sp— 
rfD° 


If we put r=0,weget mg = = 1. Thus mp is always equal to 1. 
0 Ef ‘0 
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4.14.4. | RELATION BETWEEN THE MEAN MOMENTS AND THE RAW 
MOMENTS si 


The moments about the mean can be calculated from the raw moments about 
the arbitrary origin by using the following relations : 


my = 0 

nh= m,- (m1)? 

ms = m3—3mj m3+ 2(mi)* 

m,= mj— 4mj mj + 6(m})’ mj—3 (m})* 
4.14.5,RAW MOMENTS ABOUT X=0 


When the arbitrary origin ‘A’ is equal to zero, we have X—A= X. Thus the 
raw moments about X = 0 are given by 


5 SEK es Fey 3b) (eee pe) EKA 
MS SA UM Slieae co a rage oot 
EfXr 


In general, the rth moment about the origin is defined as m, = Tr: 


When the magnitude of the given observations is small, it is convenient to 
calculate the set of moments called raw moments about zero. With the help of 
these moments, the central moments can be calculated by using the relations given 
earlier. 


4.14.6. RAW MOMENTS IN CLASS INTERVAL UNITS 


X-A 
Let u= “+ Where ‘A’is an arbitrary origin and ‘h’ is the class interval, 


The raw moments based on the variable ‘u’, are called the raw moments in the class 
interval units. These are defined as : 


? SS fa fit 25 PAD ie ' rfut 
2s sold JER Pais Sar Pai use Mash 6 SE 
‘ : x rea, Zfur 
In general, the rth moment in terms of class intervals units ism) = TE 


It is very easy to calculate these moments, With the help of these moments, 
the moments about the mean can be calculated by using the relations : 


m,= 0 
= [m; - (mp"}u# 


= [mj — 3m; m) + 2(m‘)* Jn? 


# 
I 


5 
fl 


m,= [m- 4m{ m+ 6(m’)" 3-3 (mi)‘Jht 
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The moments about the mean can be calculated by using any set of raw 
moments. For grouped data, where class interval is uniform, the raw moments 
about, class interval units are the easiest in calculations, 


4.14.7 SHEPPARD'S CORRECTION 


In case of the grouped data, the mid-points of the classes are calculated. 
These mid-points are denoted by X. In the calculation of the moments, it is assumed 
that the frequency of a certain class is concentrated on the centre or the mid-point of 
the class. This assumption is not correct. In fact, the frequency is distributed 
uniformly in the respective class. Because of this assumption, certain errors are | 
introduced in the calculations of the central moments. Sheppard suggested certain 
corrections in the moments. The corrected central moments are given below: 
2 
mz (corrected) = m2— & 


mas (corrected) = my 


W 


7 

-_— —hi 
ma (corrected) = my Q™ + 240 h 
where h is the uniform class interval of the grouped data. If the class interval is not 
uniform for all the groups, the Sheppard’s corrections are not applicable. These 
corrections are also not applicable for highly skewed grouped data. 


4,15. SYMMETRY 


A distribution is called symmetrical if its frequency curve has the same shape 
on both sides of the central line which divides the curve into two equal halves. It has 
a hump (O) in the centre and the curve spreads equally on both sides. We say that it 
has equal tails on both sides. Ina perfectly symmetrical distribution, the values of 
the mean, median and mode are equal and the two quartiles are at equal distance 
from the median. Fig.4.1.(a), Fig.4.1.(b) and Fig.4.1.(c) show symmetrical curves 
which have different standard deviations. In Fig.4.1.(a), the standard deviation is 
small, in Fig.4.1.(b), it is large and in Fig.4.1.(c), it has a moderate value, 


Qi Mean Qs Mean Mean 
Median Median Median 
Mode Mode Mode 
Fig, 4.1(a) Fig. 4.1(b) Fig, 4.1(c) 


Following is a distribution which is symmetrical having the same values of the 
mean, median and mode. 


eink iii cts : 0-10 | 10-20 | 20-20 | 0-40 | «0-0 | 
een fe f= fefa |= 


In this data, mean =, median = mode = 25. 
4.16. SKEWNESS 

A distribution is called skewed if it is not symmetrical. A skewed distribution 
has a curve with a longer tail on any direction. If the longer tail is on the right side, 
the distribution is called positively skewed. If the longer tail is on the left side, the 
distribution is called negatively skewed. In case of positive skewness, the mean is 
greater than mode and for negative skewness, the mean is less than mode. In both: 
cases the median lies between the mean and the mode. Fig.4.2.(a) pane positive 
and Fig.4.2.(b) shows negative skewness. 


Mean > Median > Mode Mean < Median < Mode 
positive skewness negative skewness 


Fig.4.2(a) Fig.4.2(b) 
For a slightly skewed distribution the distance between mean and mode is 
approximately three times of the distance between the mean and the median. 
Thus Mean-— Mode = 3 ( Mean — Median ) 
This is called empirical relation between the mean, median and mode. 
4.16.1, MEASURES OF SKEWNESS 


The difference between mean and mode is an absolute measure of skewness. 
Greater the difference between the mean and the mode, greater the skewness. The 
difference between the mean and the mode is standardised and we get what is called 
relative measure of skewness or coefficient -* skewness. 
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@ 
qi) 


ii) 


Gv) 


Karl Pearson suggested two measures of skewness which are defined below: 
Coefficient of skewness = Mean —Mode 

Sometimes the mode is not clearly located in the data with the result that it 
cannot be calculated. In such cases the median is calculated and the formula 
used is: 


F 3( Mean — Median ) 
Coefficient of skewness = 3(Mean zhodian 


4 

The above two, formulas are designed in a manner such that the coefficient of 

skewness is zero for a symmetrical distribution, it is positive for a positively 

skewed distribution and negative for a negatively skewed distribution. 

Another measure of skewness was suggested by Bowley, and after his name it 

is called the Bowley's coefficient of skewness. The formula is 

(Qs — Median ) — ( Median — Q:) 

(Qs — Median) + ( Median — Q:) 
Qs-Qi 


Bowley's Coefficient of skewness = 


Its value lies between — 1 and + 1. 

This formula compares the difference between Q, and median and the 
difference between median and Q,. If these two differences are equal, the 
distribution is symmetrical and the coefficient of skewness will be zero. 

If(Q, — Median) > (Median-Q,), the distribution has positive skewness 
If(Q,— Median) < (Median-Q,), _ the distribution has negative skewness. 

In our country the distribution of agricultural land among farmers is positively 
skewed. It has a very long tail on the right side. There are very large number of 
farmers who has the land less than 5 acres and there are very small number of 
farmers who has the land more than 500 acres. 

Karl Pearson has also given another measure of skewness based on moment 
ratios. The formula is 


Coefficient of Skewness = 2( 5B2 — 3 3 9) 


4.17. KURTOSIS 


The term kurtosis is used for peakedness of the frequency curve. A curve 


which is tall and lean at the top is called leptokurtic. A curve which is flat like a 
plate at the top is called platykurtic. A curve with a moderate peak is called normal 
or mesokurtic. For a normal curve the value of B, = 3. When f, > 3, the curve is 
called leptokurtic. When B, < 3, the curve is called platykurtic. For a large sample 


data, the moment ratios b, = ae can be calculated. For a normal curve b, = 3. 
‘2 


Kurtosis is a property of the large sample or a population. For small samples, b, 
should not be calculated. 
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Leptokurtic 


Mesokurtic 


Platykurtic 


Example 4.18, 


Fig. 4.3. 


Calculate the first four moments about the mean for the following observations: 


| 


8, 10, 15, 22, 26, 30, 35, 40, 48. 


Solution. 
The necessary calculations are given below: 
fo coy 
x Pears Lalas x) (x- xy | 
ii 
8 -—18 104976 
10 -16 65536 
15 -11 14641 
22 -4 256 
26 0 0 
30 4 256 
35 9 6561 
38416 
234256 
=X X(x- x)" 
= 234 = 464898 
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177 
Moments about the mean X are: 
we, -~ xy 
mee HME dard. 5 m= X= XP da94 | ih 
n 9 n 9 
~ X)s -~xy 4898 
mg = POR KY 2862 aig | mg= X= RY _sesnon 
n 9 n 
Example 4,19 


Calculate the first four moments about the mean for the following data. Also 
calculate bi and ba, 


Solution: 


Che necessary calculations are given below: 
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Example 4.20 
The following summations have been obtained by taking arbitrary origin X = 9. 
=f = 165, <fD = 216, XfD? = 1584, XfD3 = 3168, EfD* = 33600 


Compute the first four moments about the arbitrary origin X = 9 and convert 
them into the moments about the mean. Also calculate bi and bz. 


Solution: 
Moments about X = 9 are: 


,_ 2fD 216 _ , 2D? 1584 

1 egg oe Mss sf epee 

| _ E€D3 _ 3168 _ , BED! _ 33600 _ 
m3 = “yp = q65 = 19.2 m, = “sp = 165 = 203.64 


Moments about the mean are: 
mi = mj-mj = 0 
m2 = mg — (mj)? = 9.6 - (1.31)? = 7.88 
ms = mj —3mj{mj + 2(mi)® = 19.2 - 3(1.31)(9.6) + 2(1.31)3 = — 14.03 
(since ms is negative, the distribution is negatively skewed ). 
m4 = mj—- 4mjm; + 6(m})? m3 - 3(m))* 
= 203.64 — 4(1.31) (19.2) + 6(1.31)? (9.6) — 3(1.31)4 = 193.04 


2 
_ m3 (- 14.03 _ _ mg _ 193.04 _ 
be m= (8 = 0.40 and br = 13 = (7 gg = 3.11 
Example 4.21 - , 
Given the following information: 


Lf = 290, ZfX = 2610, EfX? = 23780, LfX*= 219530, LfX*= 2056100 
Calculate first four moments about the arithmetic mean. 


Solution: 
Moments about X = 0 are: | 
, _ EEX _ 2610 , _ EfX? _ 23780 
my" ieee 2001, ae ae ye, 90k: 
=f£X8 219530 Z£X* 2056100 


ms = "pp = 299 = 787 m= “Sp = gqq = 7090 
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Moments about the arithmetic mean are: 
mi = mj-mj = 0 
mz =m} - (mj)? = 82-(9)? = 1 
ma = m3 — 3mjm, + 2(m})? = 757 - 3(9) (82) + 2(9)= 757-2214+1458 =1 ° 
ma = m4—- 4mm, + 6(m})? my - 3(mj)4 
7090 — 4(9)(757) + 6(9)#(82) - 3(9)4 = 7090 — 27252 + 39852 - 19683 = 7 
Example 4.22. 


A survey was conducted by a manufacturing company to enquire the maximum 
price at which persons would be willing to buy their product. The following table 
gives the stated price (in rupees) by persons. 


i] 


120-130 


No. of persons 


Calculate Karl Pearson's coefficient of skewness and interpret its value. 
Solution: 


The necessary calculations are given below: 


A=105,h=10 


f = (432%) 
alee a0) 


Price (in Rs.) 


80-90 
90-100 
100-110 
110-120 
120 — 130 


= Zfu x aS) @ Fe 
Mean = A+Spxh = 105+ Too * 10 = 105+0.6 = 105.6 
Mode = I+ — xh 


(fm — f1) + (fm — f2) 
where /= 90, f, = 29, f,= 11, f,=18 and h=10 


29-11 
Thus, Mode = 90+ (29-11) + (29 18) * 10 = 90+6.21 = 96.21 
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Zfu2 i: 160 (6 \ 
S= -(2 fy eh= Aaa -(35) x10 = 12.635 


Sy, 
Mean~Mode 105.6—96.21 


Coefficient of skewness = Ss = 12.635 = 0.743 


There is a high degree of positive skewness. 
Example 4.23 


Compute the Bowley's coefficient of skewness and interpret its value for the 
data given below: 


[Groups __|_40-50 | 50-60 | 60-70 | 70-80 | 80-90 | 
[Frequency | 12] 5 | 6 | te 
Solution: 
The necessary calculations are given below: 


| Groups | 40-50 | 50-60 | 60-70 | 70-80 | 80-90 | 
f 12 15 16 15 sade S| 


[cf 


Median = Value of( 2) th item = Value of( 2) th item = 35th item 
Median lies in the class 60 — 70. 


Median = 1+3(3- °)= 60+ 32 (35 - 27) = 60+5 = 65 


Qi = Value of(2) th item = Value ot(%) thitem = 17.5thitem 


Q: lies in the class 50 - 60. 


q = 14+8(8- o= 50+ 42 (17.5 -12)= 50 + 3.67 = 53.67 


Qs = Value of (22 2) thitem = Value of(2 x10) thitem = 52.5th item 


Qs lies in the class 70 — 80. 


4 


Qs = 1+B( 2-0) = 70+ 10 (s25- 43) = 70 + 6.33 = 76.33 
_ Qs+Qi-2 median 


Bowley's coefficient of skewness = arern 


_ 16.33 + 53.67 — 2(65) 
ri 76.33 — 53.67 


Hence the distribution is symmetrical. 
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Example 4.24, 


(i) Ina frequency distribution the coefficient of skewness is 0.6. If the sum of the 
upper and the lower quartiles is 100 and the median is 38, find the value of the 
upper quartile. 

(ii) For a moderately skewed distribution the mean price is Rs.20 and the median 
price is Rs.17. If the coefficient of variation is 20 %, find the coefficient of 
skewness of the distribution. 

(iii) In a certain distribution the following results were obtained: 

Mean = 45,Median = 48, Coefficient of skewness = — 0.4. Find the value of 
the standard deviation. 

(iv) Coefficient of skewness of a di8tribution is 0.64, its mean is 82 and mode is 50. 
Find the coefficient of variation. 

Solution: 

(i) Coefficient of skewness = 4 Os 8 eee 
Here, Coefficient of skewness = 0.6, Q, + Q, = 100, Median = 38, Q, = 100-Q, 

_ _100-2(38)_ _ 24 
Therefore, 0.6= Q:—100+Qs~ 2Qs-100 
0,6(2Q,-100) = 24 or 1.2Q,—60=24 or 1.2Q,=24+60=84 or 
84 
io) eieeen RG yes eee tu 

(ii) Coefficient of variation (C.V.) = isan: 100 
Here, Mean = 20,Median = 17, Coefficient of Variation = 20 % 
Therefore, 20 = ax 100 or 5S = 200r S = 2 = 4 

i ( — Median ) - 17) 
Coefficient of skewness = 3(Mean 5 odin) _ 3 20 ri 12) _ 2.25 

so as . F a ( bead i 

(iii) Coefficient of skewness = 3 Mean = Median 
Here, Mean = 45, Median = 48, Coefficient of skewness = —0.4 

_ 3445-48) 9 Bite Oea tii 
Therefore, -0.4 = 5 ies Shh sO S= ean 22.5 
(iv) Coefficient of skewness = Mean — Mode 5 Mode 
Here, Mean=82, Mode=50, Coefficient of skewness = 0.64 
82-50 _ 32 fe Nebr 
Therefore, 0.64 = 5 aig vor S = 0.64 = 50 
? brea 5 
Coefficient of Variation (C.V.) = s x100 = = x 100 = 60.98 % 
Xx 
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Basle Statistics Part-1 
SHORT DEFINITIONS Ze-"” 

Dispersion or Variation 


The degree to which a set of values vary about their mean. 
or 
The spread or variability of scores about the measure of central tendency. 
Measure of Dispersion 
_-A measure of dispersion indicates how close or far apart the values are from the 
“mean or other measure of central tendencys pe 
or . 
A measure of dispersion describes the spread or scatter of the individual yalues 
around the central position. 
>» (Absolute Measure of Dispersion 
An absolute measure of dispersion is that which measures the variation present 
between the observations in the unit of the variable, 
Relative Measure of Dispersion ‘ 
A relative measure of dispersion is that which measures the vaviation present . 
between the observations relative to their average. It is used to compare the 
variation between two or more sets of data. " 
Range 
cThe range is the difference between the largest and the smallest value in a set of 
observations. 
or 
The range of a data set is equal to the largest measurement minus the smallest 
measurement. z 
“Anterquartile Range 
The interquartile range is obtained by subtracting the score of the first quartile from 
the score of the third quartile. 
or 
The interquartile range is the distance between the lower and upper quartiles. 
_- Quartile Deviation or Semi-Interquartile Range 
Quartile deviation is half of the difference between upper and lower quartiles. 
or 
Interquartile range divided by 2, is called the quartile deviation. 
Mean Deviation 
The arithmetic mean of the absolute values of the deviations from the arithmetic 
mean is known as mean deviation, 
or 
Sum of the deviation of each score from the mean, without regard to sign, divided by 
the number of scores is called mean deviation. 
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Variance | 
The variance is defined as the average squared deviation from the mean. 
or 
Sum of the squared deviations from the mean, divided by n, is called variance. 
Standard Deviation! 
The positive square root of the average of the squared deviations from the mean is 
called standard deviation.) 
, or 
The positive square root of the sum of the squared deviations from the mean divided 
by n, is called standard deviation. 
ySefficient of Variation 
Coefficient of variation.is defined as the standard deviation of a distribution divided 
by the mean of the distribution and multiplied by 100. 
or 
The ratio of the standard deviation to the arithmetic mean expressed as a 
percentage is known as coefficient of variation. 
Empirical Rule 
A rule that states the percentages of items that are within one, two, and three 
standard deviations from the mean for mound-shaped, or bell-shaped, distributions. 
‘Symmetrical Distribution 
& distribution is called symmetrical if its frequency curve has the same shape on 
both sides of the central line which divides the curve into two equal parts. In a 
perfectly symmetrical distribution, the values of the mean, median and mode are 
equal and the two quartiles are at equal distance from the median. 
‘Skewed Distribution 
A distribution in which the observations are concentrated at one end (tail) of the 
distribution, so that the most typical scores are relatively low (skewed to the right) 
or relatively high (skewed to the left). 
(-k6ptokurtic Distribution 
A bell-shaped distribution characterized by a piling up of scores in the center of the 
distribution. 
(Ae sokurtic Distribution 
Bell-shaped distribution; “ideal” form of normal curve. 
4 Matykurtic Distribution 
A frequency distribution characterized by a flattening in the central position. 


MULTIPLE - CHOICE QUESTIONS 


1. The scatter in a series of values about the ayerage is called: 
(a) central tendency dispersion 
(c) skewness (d) symmetry 
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a 


10, 


13, 


The measurements of spread or scatter of the individual values around the 
central position is called: 
ey measures of dispersion (b) _ measures of central tendency 
(c) measures of skewness (d) measures of kurtosis 
The measures used to calculate the variation present among the observations 
in the unit of the variable is called: 
(a) relative measures of dispersion (b) coefficient of skewness 
(co) absolute measures of dispersion (d) coefficient of variation 

The measures used to calculate the variation present among the observations 
relative to their average is called: 
(a) coefficient of kurtosis (b) absolute measures of dispersion 
(c) quartile deviation (d)~ relative measures of dispersion 
The degree to which numerical data tend to spread about an average value is 
called: 


(a) constant (b) flatness 
( variation (d) skewness 
The measures of dispersion can never be: 

(a) positive (6) zero 

(Q) negative : (d) equal to 2 


If all the scores on examination cluster around the mean, the dispersion is said 
to be: 


(a)~ small (b) large 

(c) normal (d) symmetrical 

If there are many extreme scores on an examination, the dispersion is: 
(ay large (b) small 

(c) normal (d) symmetrical 


Given below the four sets of observations. Which set has the minimum 
variation? 


(a) 46, 48, 50, 52, 54 (b) 30, 40, 50, 60, 70 

(c) 40, 50, 60, 70, 80 @ 48, 49, 50, 51, 52. 

Which of the following is an absolute measures of disp: sion? 

(a) Coefficient of variation (b) Coefficient of dispersion 

(& Standard deviation (d) Coefficient of skewness 

The measure of dispersion which utilizes only extreme values is called: 
<>KG) v TOSRM. Ie Saale (b) median 

(oY range (d) coefficient of variation 

The measure of dispersion which uses only two observations is called: 

( range (b) quartile deviation 

(c) mean deviation (d) standard deviation 


In quality control of manufactured items, the most commonly used measure of 
dispersion is: 

range (b) average deviation 
(c) standard deviation (d) quartile deviation 
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14, 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


22. 


28. 


24. 


25. 


The range of the scores 29, 3, 143, 27, 99 is: 

(ay 140 (b) 143 

(c) 146 (d) 70 

If the observations of a variable X are - 4, - 20, — 30, — 44 and — 36, then the 
value of the range will be: : 


(a) -48 @y 40 


() -40 (d) 48 
The range of the values — 5, — 8, ~ 10, 0, 6, 10 is: 
(= <0 (b) 10 
@ -10 ay 20 


If Y = aX + b, where a and b are any two numbers and a 0, then the range of 
Y values will be: 


(a) range (X) (b) arange (X) +b 
(©) arange (X) —b (@ |al range (X) 
If the maximum value in a series is 25 and its range is 15, the minimum value 
of the series is: 
10 (b) 15 
(©) 25 (d) 36 
Half of the difference between upper and lower quartiles is called: 
(a) interquartile range quartile deviation 
(c) mean deviation (d) standard deviation 
If Qs = 20 and Q: = 10, the coefficient of quartile deviation is: 
(a) 3 1! 3 
(©) 2/3 (@) 1 
Which measure of dispersion can be computed in case of open-end classes? 
(a) standard deviation (b) range 
oe quartile deviation (a) coefficient of variation 


If Y = aX + b, where a and b are any two constants and a + 0, then the quartile 
deviation of Y values is equal to: 


(a) aQ.D(X) +b O lal QDX) 

(c) Q.D(X) -b (a) |b/Q.D(X) 

The sum of absolute deviations is minimum if these deviations are taken from the: 
(a) mean (b) mode 

( media. (d) upper quartile 

The mean deviation is minimum when deviations are taken from: 

(a) mean (b) mode 

( median (d) zero 


If Y = aX + b, where a and b are any two numbers but a # 0, then M.D(Y) is 
equal to: 

(a) M.D(X) (b) M.D(X) +b 

(% \al M.D(X) (a) M.D(Y) + M.D(X) 
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26. 
27, 
28. 


29. 
30. 
31. 


32. 
33. 


34. 
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The mean deviation of the scores 12, 15, 18 is: 

(a) 6 (b) 0 

© 3 ( 2 

Mean deviation computed from a set of data is always: 

(a) negative (b) equal to standard deviation 

(© more than standard deviation (® less than standard deviation 

The average of squared deviations from mean is called: 

(a) mean deviation (by variance 

(c) standard deviation : (ad) coefficient of variation 

The sum of squares of the deviations is minimum, when deviations are taken 
from: 

(a) mean (b) mode 

(c) median (d) zero 

Which of the following measures of dispersion is expressed in the same units as 
the units of observation? 

(a) Variance (b) Standard deviation 

©) — Coefficient of variation (@) Coefficient of standard deviation 
Which measure of dispersion has a different unit other than the unit of 
measurement of values: 


(a) range (b) standard deviation 
(c) variance (a) mean deviation 
Which of the following is a unit free quantity: 

(a) range (b) standard deviation 
(c) coefficient of variation (d) arithmetic mean 
If the dispersion is small, the standard deviation is: 

(a) large (b) zero 

(c) small (d) negative 

The value of standard deviation changes by a change of: 

(a) origin (b) scale 

(c) algebraic signs (a) none 


35. The standard deviation of a distribution divided by the mean of the distribution 


36. 


37. 


and expressing in percentage is called: 

(a) coefficient of standard deviation (b) coefficient of skewness 

(c) coefficient of quartile deviation (d) coefficient of variation 

The positive square root of the mean of the squares of the deviations of 
observations from their mean is called: 


(a) variance (b) range 

(c) standard deviation (d) coefficient of variation 
The variance is zero only if all observations are the: 

(a) different (b) square 


(c) square root (d) same 
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38. 


39. 


40, 


41, 


42, 


43. 


The standard deviation is independent of: 
(a) change of origin (b) change of scale of measurement 
(c) change of origin and scale of measurement (d) difficult to tell 


If there are ten values each equal to 10, then standard deviation of these 
values is: 


(a) 100 (b) 20 

(ce) 10 (@) 0 

If X and Y are independent random variables, then S.D.( X + Y ) is equal to: 
(a) S.D(X) +S8.D(Y) (b) Var(X) + Var(Y) 


(©) \War(X) + Var(¥) (@)~VVar(X) + Var(¥) 

S.D.(X) = 6 and S.D.(Y) = 8. If X and Y are independent random variables, 
then S.D.(X - Y ) is: 

(a) 2 (b) 10 

( 14 (d) 100 

For two independent variables X and Y if S.D.(X) = 1 and S.D.(Y) = 3, then 
Var (3X - Y ) is equal to: 

(a) 0 (b) 6 

(c) 18 (d) 12 

If Y = aX + b, where a and b are any two constants and a + 0, then Var (Y) is 
equal to: 


(a) a Var(X) (b) a Var(X) +b 
(ce) a?Var(X) -b (d) a? Var(X) 
44, If Y =aX + b, where a and b are any two numbers but a #0, then S.D.(Y) is equal to: 
(a) S.D(X) (b) a.S.D(X) 
() |alS.D(X) (a) aS.D(X) +b 


45. 


46. 


The ratio of the standard deviation to the arithmetic mean expressed as a 
percentage is called: 


(a) coefficient of standard deviation (b) coefficient of skewness 
(©) coefficient of kurtosis (a) coefficient of variation 
Which of the following statements is correct: 

(a) The standard deviation of a constant is equal to unity 


(b) The sum of absolute deviations is minimum if these deviations are taken 
from the mean. 


(c) The second moment about origin equals variance 


(d) The variance is positive quantity and is expressed in square of the units 
of the observations. 
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47. Which of the following statements is false? 


(a) The standard deviation is independent of change of origin 

(b) If the moment coefficient of kurtosis B2 = 3, the distribution is meokurtic 
or normal. 

(c) Ifthe frequency curve has the same shape on both sides of the central line 
which divides the curve into two equal parts, is called a symmetrical 
distribution. 

(d) Variance of the sum or difference of any two variables is equal to the sum, 
of their respective variances. 


48. If Var(X) = 25, then S.D ( ax 5) is equal to: 
(a) 16/2 (be) 50 
(c) 25 (@) 6 
49. To compare the variation of two or more than two series, we use 
(a) combined standard deviation (b) corrected standard deviation 
(c) coefficient of variation (d) coefficient of skewness 
50. The standard deviation of -— 5, — 5, - 5, — 5, — 5 is: 
(a) -5 (b) +5 
(c) 0 (d) -25 
51. Standard deviation is always calculated from: 
(a) mean (b) median 
(c) mode (d) lower quartile 
52. The mean of an examination is 69, the median is 68, the mode is 67, and the 
standard deviation is 3. The measures of variation for this examination is: 
(a) 67 (b) 68 
() 69 (@) 3 
53, The variance of 19, 21, 23, 25 and 27 is 8. The variance of 14, 16, 18, 20 and 22 is: 
(a) greater than 8. (b) 8 
(c) less than 8 (d) 8-5=3 
54, Ina set of observations the variance is 50. All the observations are increased by 
100 %. The variance of the increased observations will become: 
(a) 50 (b) 200 
(c) 100 (d) nochange. 


55. 


Three factories A, B, C have 100, 200 and 300 workers respectively. The mean 
of the wages is the same in the three factories. Which of the following 
statements is true? 

(a) There is greater variation in factory C. 

(b) Standard deviation in factory A is the smallest. 

(c) Standard deviation in all the three factories are equal. 

(d) None of the above. 
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56. 


67. 


58. 


59. 


60, 


61. 


62. 


63. 


64, 


65. 


66. 


An automobile manufacturer obtains data concerning the sales of six of its 
deals in the last week of 1996. The results indicate that the standard deviation 
of their sales equals 6 autos. If this is so, the variance of their sales equals: 


(a) V6 ) 6 
© a @ 36 


If standard deviation of the values 2, 4, 6, 8 is 2.236, then standard deviation of 
the values 4, 8, 12, 16 is: 


(a) 0 (b) 4.472 (c) 4,236 (d) 2.236 


Var(X) = 4 and Var(Y) = 9. If X and Y are independent random variable then 
Var(2X + Y) is: 
(a) 13 (b) 17 (c) 25 (d) -1 


If X=Re. 20, S = Rs. 10, then coefficient of variation is: 

(a) 45% (b) 50% () 60% (d) 65% 

Which of the following measures of dispersion is independent of the units 
employed? 


(a) Coefficient of variation (b) Quartile deviation 
(c) Standard deviation (d) Range 
In Sheppard’s correction 1, is equal to: 

2 2 
(a) py+ (b) ba- e 

2 

© w-% @ w,-= 
The moments about mean are called: 
(a) raw moments (b) central moments 
(c) moments about origin (d) all of the above 
The moments about origin are called: Hl 
(a) moments about zero (b) raw moments 
(c) both (a) and (b) (d) neither (a) nor (b) 
All odd order moments about mean in a symmetrical distribution are: 
(a) positive (b) negative 
(c) zero (d) three 
The second moment about arithmetic mean is 16, the standard deviation will be: 
(a) 16 (b) 4 @) 2 (d) 0 


The first and second moments about arbitrary constant are - 2 and 13 
respectively. The standard deviation will be: 
(a) -2 (b+) 3 
(c) 9 (d) 13 
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67. Moment ratios B: and fz are: 
(a) independent of origin and scale of measurement 
(b) expressed in original unit of the data 


(c) unitless quantities (d) both (a) and (c) 
68. The first moment about X=0 ofa distribution is 12.08. The mean is: 
(a) 10.80 (b) 10.08 
(c) 12.08 » (d) 12.88 
69. First two moments about the value 2 of a variable are 1 and 16. The variance 
will be: 
(a) 13 (b) 15 
(c) 16 (d) difficult to tell. 


70. The first three moments of a distribution about the mean X are 0, 4 and 0. The 
distribution is: 


(a) symmetrical. (b) skewed to the left. 
(c) skewed to the right. (d) normal 

71. Ifthe third central moment is negative, the distribution will be: 
(a) symmetrical (b) positively skewed 
(c) negatively skewed (d) normal 

72. Ifthe third moment about mean is zero, then the distribution is: 
(a) positively skewed (b) negatively skewed 
(c) symmetrical (d) mesokurtic 

73. Departure from symmetry is called: 
(a) second moment (b) kurtosis 
(c) skewness (d) variation 

74. In asymmetrical distribution, the coefficient of skewness will be: 
(a) 0 (b) Qu 
() Qs (d) 1 

75. The lack of uniformity or symmetry is called: 
(a) skewness (b) dispersion 
(c) kurtosis (d) standard deviation 

76. For a positively skewed distribution, mean is always: 
(a) less than the median. (b) less than the mode. 
(c) greater than the mode. (d) difficult to tell. 

77. Yor a symmetrical distribution: 
(a) Bi>0 () B,<0 
(©) Bi =0. (d) B,=3. 

78. Ifmean = 50, mode = 40 and standard deviation = 5, the distribution is: 
(a) positively skewed (b) negatively skewed 


(c) symmetrical (A) difficult to tell 
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79, 


80. 


81. 


82. 


83. 


84, 


85. 


86. 


87, 


88. 


89. 


90, 


91. 


If mean = 25, median = 30 and standard deviation = 15, the distribution will be: 
(a) symmetrical (b) positively skewed 

(c) negatively skewed (d) normal 

If mean = 20, mode = 16 and standard deviation = 2, then coefficient of 
skewness is: 

(a) 1 (b) 2 (c) 4 (da) -2 

If mean = 10, median = 8 and standard deviation = 6, then coefficient of 
skewness is: 


(a) 1 (b) -1 (c) 2/6 (d) 2 

If the sum of deviations from median is not zero, then a distribution will be: 
(a) symmetrical (b) skewed 

(c) normal (d) all of the above 

In case of positively skewed distribution, the extreme values lie in the: 
(a) middle (b) left tail 

(c) right tail (d) anywhere 

Bowley’s coefficient of skewness lies between: 

(a) Oand1 (b) -land+1 

(ec) -lando (d) -2and+2 

In a symmetrical distribution, Qs— Qi= 20, median = 15. Qs is equal to: 
(a) 5 (b) 15 (©) 20 (d) 25 


Which of the following is correct in a negatively skewed distribution? 

(a) The arithmetic mean is greater than the mode, 

(b) The arithmetic mean is greater than the median. 

(c) (Q3;— median ) = (median — Q1).@ (Q:- median ) < ( median — Q,). 
The lower and upper quartiles of a distribution are 80 and 120 respectively, 
while median is 100. The shape of the distribution is: 


(a) positively skewed. (b) negatively skewed. 

(c) symmetrical. (d) normal 

In a symmetrical distribution Q: = 20 and median = 30. The value of Q, is: 
(a) 50 (b) 35 (c) 40 (d) 25 

The degree of peakedness or flatness of a unimodal distribution is called: 
(a) skewness (b) symmetry 

(c) dispersion (d) kurtosis 


For a leptokurtic distribution, the relation between second and fourth central 
moment is: 


2 

@) p,=3.2 () Hy <8yp 

© n> 3.3 @) w= 

For a platykurtic distribution, the relation between HM, and pL, is: 
2 

@) 1, <3. () 3Hp> by 


(c) both (a) and (b) @) by =p 
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92. 


93. 


94, 
95> 
96. 


97. 


For a mesokurtic distribution, the relation between fourth and second mean 
moment is: 


(@) He= 3h (>) Hy> 3H 
© e<3hy @ we/ Hy 


The second and fourth moments about mean are 4 and 48 respectively, then 
the distribution is: 


(a) leptokurtic (b)  platykurtic 

(c) mesokurtic or normal (a) positively skewed 
In a mesokurtic or normal distribution, pu = 243, The standard deviation is: 
(a) 81 (b) 27 

() 9 @ 3 

The value of B2 can be: 

(a) less than 3 (b) greater than 3 

(c) equal to 3 (ad) all of the above 
In a normal ( mesokurtic ) distribution: 

(a) Bi=Oand p2=3 (b) Bi=3 and f2=0 
(c) Bi=Oand B2>3 (d) Bi=Oand f2<3 


Any frequency distribution, the following empirical relation holds: 
(a) Quartile deviation = 2 Standard deviation 


4 
(b) Mean deviation = 5 Standard deviation 


(c) Standard deviation = : Mean deviation = 3 Quartile deviation 
(d) All of the above 


Answers 
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SHORT QUESTIONS 

1 Explain the meaning of the term dispersion or variation. 

.2 What are the qualities of a good measure of dispersion? 

8 Write down the types of dispersion. 

Explain the absolute measure of dispersion. 

5 Define relative measure of dispersion. 

6 Differentiate between absolute and relative measures of dispersion. 

.7 What are the different absolute measures of dispersion? 

8 Enlist the relative measures of dispersion. 

-9 Write a short note on range, 

Q.10 Define the range and how is it calculated for grouped data. 

Q.11 State the advantages and disadvantages of the range. 

Q.12 Given Xm = 100 and Xo = 25. Find coefficient of range. 

Ans. 0.6 

Q.13 Given upper class boundary of the highest class = 80 and lower class 
boundary of the lowest class = 20. Compute coefficient of range. 

Ans. 0.6 

Q.14 Given mid value of the highest class = 75 and mid value of the lowest 
class = 15. Determine range and coefficient of range. 

Ans. 60 and 0.67 

Q.15 Define inter-quartile range. 

Q.16 Write down the definition of quartile deviation. 

Q.17 Define semi inter-quartile range and its coefficient. 

Q.18 Given Q, = 25 and Q, = 75. Find quartile deviation, 

Ans. 25 

Q.19 Given Q, = 125 and Q; = 175. Find coefficient of quartile deviation. 

Ans. 0.17 

Q.20 In a symmetrical distribution Qi = 140 and median = 150. Find quartile 


deviation. 

Ans. 10 

Q.21 In a symmetrical distribution median = 50 and Qs = 60. Find quartile 
deviation. 

Ans. 10 

Q.22 Given quartile deviation = 11.25 and Qs = 90.745. Find coefficient of quartile 
deviation. 

Ans. 0.14 

Q.23 Given quartile deviation = 1.75 and Qi = 40.25. Find coefficient of quartile 
deviation. 

Ans. 0.0417 


Q.24 What is mean deviation and how is it calculated? 
Q.25 Define average deviation and its coefficient, 

Q.26 Given X = 2, 4, 6. Find mean deviation from median. 
Ans, 4/3 
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Q.27 


Ans, 


Q.28 
Ans. 
Q.29 


Ans. 
Q.30 


Ans. 


Q.31 


Ans. 
Q.32 
Q.33 
Q.34 
Q.35 
Q.36 
Q.37 
Q.38 
Q.39 
Q.40 
Q.41 
Q.42 
Ans. 
Q.43 
Ans. 
Q.44 
Ans. 
Q.45 
Ans. 
Q.46 
Ans. 
Q.AT 
Ans. 
Q.48 
Ans, 
Q.49 


Ans. 
Q.50 


Ans. 


Basic Statistics Part-I 
Given X = 4, 6, 8, 8, 10. Find mean deviation from mode. 
1.6 
Given n= 6, X=4 and |X- sa = 24. Find coefficient of mean deviation. 


1 

Given n = 9, mode = 7 and £|X — mode| = 23. Compute mode coefficient of 
dispersion. 

0.37 

Given median = 8, n = 4 and }|X - median| = 48. Compute coefficient of 
mean deviation from median. 

1.5 

The sum of absolute deviations from median 25 for 9 observations is 180. 
Find the coefficient of mean deviation. 

0.8 

Define the standard deviation. 

‘What is standard deviation and how is it calculated? 

Write down the various formulas of standard deviation. 

Write down the various formulas of variance. 

Define population variance. 

Explain the biased sample variance and unbiased sample variance. 

Write down the mathematical properties of standard deviation, 

Write down the mathematical properties of variance. 

Write short note on coefficient of variation. 

Define the coefficient of variation and coefficient of standard deviation. 

Find biased sample standard deviation of the scores 30, 35, 40. 

4.08 

Find unbiased sample standard deviation of the scores 30, 35, 40. 

5 

Given X = 2, 4, 6, 8, 10. Find biased sample variance. 

8 

Given X = 2, 4, 6, 8, 10. Find unbiased sample variance. 

10 

If Var( X ) = 25, then find Var( 2X + 4) and S.D.( 4 - 2X) 

100 and 10. 

Given EX= 180, S’= 36 and n=5, Find EX’. 

6660 

Given EX = 180, X* = 6660 and n = 5. Find coefficient of variation. 

16.67% 

The mean and coefficient of variation of 20 observations are 20 and 25% 
respectively. Find variance. 

25 

The mean of 200 items is 48 and their standard deviation is 3. Find the sum 
of squares of all items. 

462600 ~ 
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Q.51 


Ans. 
Q.52 


Ans. 
Q.53 


Ans, 
Q.54 


Ans. 
Q.55 


Ans. 
Q.56 
Q.57 
Q.58 
Q.59 
Q.60 
Q.61 
Q.62 
Q.63 


Q.64 


Ans. 
Q.65 


Ans. 
Q.66 


Ans. 
Q.67 


Ans, 
Q.68 
Ans. 


In a surprise checking of passengers in a local bus, 5 passengers without 
tickets were caught. The sum of squares and the standard deviation of the 
amount found in their pockets were Rs. 5000 and Rs. 10 respectively. If the 
total fine imposed is equal to the amount discovered from them, and fine 
imposed is uniform, Find amount of fine to be paid by each passenger. 

Rs. 30 

Given n = 100, 5(X - 4) =-11 and X( X - 4 )?= 257. Compute coefficient of 
variation. 

41.13% 


Given X = 20 and Var( X ) = 25. If Y = 2X — 40 then find mean and variance 
of Y. 

0 and 100 

For a set of 10 observations, the sum of the deviations from 25 is 100 and the 
sum of the squares of these deviations is 1640. Find the coefficient of 
variation. 

22.86% 

The sum of squares of deviations taken from mean 50 for 10 sample 
observations is 640. Find the coefficient of variation. 

16% 

Write down the various measures of dispersion along with their formulae. 
Explain the moments about mean. 

Discuss the relation between the mean moments and the raw moments. 
Write first four raw moments in class interval units. 

Write the formulas of first four moments about origin. 

Write down the formulas for moments about the arbitrary origin. 

What is meant by moment ratios? 

Define the moment ratios ( fi and fz ). For what purposes these are 
computed? 

The first two moments of a distribution about the value 5 of a variable are 2 
and 32. Find variance. 

28 

The first three moments of a distribution about the value X = 20 are 1, 4, 10. 
Find the value of b,. 

bi=0 

The first four moments about the arithmetic mean of a distribution are 0, 4, 6 
and 48, Find b,. d 

be=3 oy 

The first two moments of a distribution abgut X = 10 are 2 and 20. Find 
coefficient of variation. 

83.33% 

The first moment about X = 25 is 5. Find arithmetic mean. 

30 
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Given mean = 25, variance = 16 and Bi = 1. Find the third moment about 
mean. 

64 

Write the formulas of corrected central moments, 

Explain the term skewness. 

Distinguish between positive and negative skewness with diagrams. 

What is meant by skewness and how it can be calculated? 

Distinguish between symmetry and skewness, 

Write down the various formulas for the measurement of skewness, 

Can the value of mean, median and mode be the same? If yes, state the 
situation. 

Can the difference between Qs and median be equal to the difference between 
median and Qi? If yes, state the situation. 

Given mean = 100, mode = 95 and standard deviation = 10. Find coefficient of 
skewness. 

0.5 

Given mean = 50, median = 48 and standard deviation = 6, Find Karl 
Pearson's coefficient of skewness. 

1 

For a moderately skewed distribution the mean price is Rs.20, the median 
price is Rs.17 and the coefficient of variation is 20%. Find coefficient of 
skewness. 

2.25 

Given mean = 50, median = 48 and coefficient of skewness = 1. Find the value 
of the variance. 

36 . 

In a symmetrical distribution Q, = 40 and Q, = 60. Find median. 

50 

What would be the shape of the distribution if: 

(a) mean = median = mode (b) mean > median > mode 

(c) mean < median < mode (d) (Q; — median ) > ( median — Q,) 

(a) symmetrical (b) positively skewed 

(c) negatively skewed (d) Positively skewed 

Given Qs - median = 25 and median — Qi = 16. Compute Bowley’s coefficient 
of skewness. 

0.22 

Define kurtosis. 

Define leptokurtic distribution. 

Define mesokurtic distribution, 

Define platykurtic distribution. 

Explain skewness and kurtosis. 
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EXERCISES 


Q.1 Compute the range and the coefficient of range for the observations: 

(i) 2000, 35, 400, 15, 40, 1500, 300, 6, 90, 250 

(ii) 8, 11, 7, 15, 9, 12, 10,6 

(iii) 2, 3, 3, 5, 5, 5, 8, 10, 12 

(iv) X,=1, X,=10, X,=2, X, = 9, X,=3, X,=8, X,=4, X,=7, X,)=5,X,=6, 
Ans. (i) 1994, 0.994 (ii) 9, 0.429 (ii) 10,0.714 (iv) 9, 0,818. 


Ans, 8, 0.667 
Q.8 Find the range and coefficient of range from the following frequency 


distribution, 
d 
pe eee | 


No. of apples 

Ans. 120, 0.482 

Q.4 For the data given below, compute quartile deviation and coefficient of quartile 
deviation: 1950, 1870, 1870, 1775, 1745, 1720, 1740, 1670, 1710, 1590, 1030, 
1110, 1070, 1190, 1230, 1310, 1350, 1332, 1430, 1460, 

Ans, 246.875, 0.165 

Q.5 Calculate the quartile deviation and its coefficient from the following data: 


(2) 
= 
(=) 
= 
rar 
= 
to 


RollNo. [a [2]af4a]spela)s 
Marks | 39 | 41 | 41 | 40 | 40 43 | 43 | 42 | 42 | 44 | 44 | a5 


Ans. 1.75, 0.042. 

Q.6 Consider the following data: 
[Howry wages Rs) [ 40-60 | 60-60 [ oo=70 [0-50 [ww] 
[Number fpenons | 2 [ 6 | 6 [= | 2] 
Compute the quartile deviation and coefficient of quartile deviation. 

Ans: 9, 0.14 


Q.7 Calculate the quartile deviation and coefficient of quartile deviation from the 


distribution of aptitude scores, 


| No.ofappticante [7 [a2 _| 


Ans: 7.415, 0.11 
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Q.8 Compute the semi-inter-quartile range for the cholesterol level of the 20 
patients at a hospital. 


Cee Bit kiresrecti eel 


Ans: 4,25 

Q.9 Calculate the mean deviation from mean and mean deviation from median for 
the following data:2, 6, 9, 12, 8, 13, 5, 6, 23, 16. 

Ans, 4.8, 4.6 

Q.10 Calculate the mean deviation from (i) arithmetic mean, (ii) median, (iii) mode 
in respect of the marks obtained by nine students given below and show that 
the mean deviation from median is minimum. Marks (out of 25): 7, 4, 10, 9, 15, 
12, 7, 9, 7. If the marks are doubled (converted out of 50), will the variation in 
marks increase? Give reasons. 

Ans. (i) 2.35 Gi) 2.33 (iii) 2.56. 

Q.11 Calculate the mean deviation about mean from the following distribution of 
differences in ages between husbands and wives in a particular community: 


ee 


Ans: 4.024 


Q.12 For the following frequency table, calculate the mean deviation about median 
and its coefficient. 


Height in inches 


[0-6 | 


Number of boys 


Ans: 1.01, 0.02 


Q.13 Following are the monthly salaries of a sample of 30 workers in an office. 
Compute the mean deviation from the mode. Also compute the mode coefficient 


of dispersion. 


Monthly salary (in thousand Rs.) 


6 and under 8 

8 and under 10 
10 and under 12 
12 and under 14 
14 and under 16 


Number of workers 
6 
5 


10 
6 


Ans; 1.98, 0.18 
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Q.14 Calculate mean deviation with mean and median of the marks of the 80 
students. 


@=70 7080 | 0-90 
RIT? ene wr 
Ans. 14.188, 14.063 

Q.15 Ten measurements gave the following results: 

Length in cms: 77, 73, 75, 70, 72, 76, 75, 72, 74, 76, 
Find variance and standard deviation. 

Ans. 4.4, 2.098. 

Q.16 Find standard deviation and coefficient of variation of the numbers: 3.2, 4.6, 
2.8, 5.2, 4.4, 

Ans. 0.90, 22.28 % 

Q.17 Calculate the variance of 14, 16, 18, 20, 22. Without recalculation, find the : 
variance after (i) adding 5 to each observation (ii) subtracting 5 from each 
observation (iii) multiplying each observation by 2 (iv) dividing each 
observation by 2 (v) multiplying each observation by 2 and adding 5. 

Ans.8 (i) 8 Gi) 8 Git) - 32 (iv) 2 (v) 82 

Q.18 Compute the coefficient of variation of (i) X (ii) Y = 2X where X has values, 2, 3, 
3, 5, 5, 5, 8, 10, 12. Are the two results same? If so, give reason. 

Ans. (i) 54.5% (ii) - 54.5 %. 

Reason: Coefficient of variation is independent of change of scale. 

Q.19 Find the arithmetic mean and the standard deviation of the observations: 40, 
40, 50, 60, 70, 70, 80, 80, 90. Also find the arithmetic mean and the standard 
deviation after increasing the observations by (i) 10 units (ii) 10 percent. 

Ans. 64.44, 17.07 (i) 74.44, 17.07 Gi) 70.89, 18.78 

Q.20 What will be the standard deviation and the variance in each of the following 
cases? (i) 2X (ii) X +2 (iii) BX +4. If Var (X) = 

Ans. (i) 10, 100 (ii) 5, 25 ii) 10, 100. 

Q.21 Given the following information: 


n, = 160, 2(X,-100) = 180, £(X,—100)?= 245820 

n, = 200, 2(X,-100) = 250, X(X,-100)’= 43850 

Compare the variability by calculating the coefficient of variation of each 
series, 


Ans. C.V.(1) = 39.94 %, C.V.(2) = 14.57 %. 

Q.22 Rawat electronics is giving two training programmes? Two groups were trained 
for the same task. For the first group trained by programme A, it took a mean 
of 28.74 hours with a variance of 79.39. For the second group trained by 
programme B, it took a mean of 20.5 hours with a variance of 54.76. Which 
training programme has less relative variability in its performance? 

Ans. Programme A 
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Q.23 The weight measurements in grams of 20 eggs are given below: 
65, 68, 60, 76, 65, 74, 58, 56, 61, 57, 63, 64, 72, 66, 65, 67, 65, 63, 67, 67. 
Find the percentage of observations lying within the limits 


@ X4S Gi) X +28 (ii) X + 3s. 
Ans. (i) 70% (ii) 95% (iii) 100% 


Q.24A professor of statistics wants to estimate the average study time per week for 
students in statistics courses at his college. To accomplish this, he selects a 
sample of 25 students and records their weekly study times. The times, to the 


nearest hour, are: 
ps Ac) 8 fs | tole a] 


Nonterofondms | 2 [af] s[a}e | 
Compute the standard deviation. 
Ans: 1.30 


Q.25A frequency distribution on the length of telephone calls monitored at the 
switchboard of an office is given below: 


Length of calls ( minutes ) Number of calls 
5 


0 and under 2 
10 
40 
30 
15 


2 and under 4 
4 and under 6 

Compute the variance and coefficient of variation of the calling time. 

Ans: 4.16, 35.17 % 


6 and under 8 
8 and under 10 
Q.26 Calculate standard deviation taking : 
X- 22.5 


@ D=X-17.5 and (i) u = ae 


[is 20 [20~35 [e000 a 
Pe fe fe [| 
Ans. (i) 7.75 (ii) 7.75 


Q.27 Compute the variance from the following data using step deviation method. 


Ans: 45564 
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Q.28 The goals scored by two teams A and B in a football season were as follows: 


[Necfroteowdinamen | [1 ]2[es[ a] 
peach. | 
frome fw fefels[o| 


Which team should be considered as more consistent. 
Ans. Team B 


Q.29 An analysis of the daily wages paid to workers in two firms A and B belonging 
to the same industry, gives the following results. 


Find out: (i) which firm pays larger amount as daily wages? 
Gi) In which firm is there greater variability in individual wages? 
Ans. (i) Firm A (ii) Firm B. 
Q.30 You are provided with the following raw sums in a statistical investigation of 
two variables, X and Y: 
EX= 235, LY=250, <©X*=6750, ryY?=6840 
Ten pairs of values are included in the survey. Compute the standard deviation 
of the X and Y variables. 
Ans.S.D.(X) = 11.08, S.D.(Y) = 7.68 
Q.31The mean of 200 items is 48 and their standard deviation is 3. Find the sum 
and sum of squares of all these items. 
Ans.=X = 9600, =X? = 462600 
Q.32.A computer calculated mean and standard deviation from 20 observations as 
42 and 5 respectively. It was later discovered at the time of checking that it 


had copied down two values as 45 and 38, whereas the correct values were 35 
and 58. Find correct value of coefficient of variation. 

Ans. 14.65 % 

Q.33A student calculated the values of mean and standard deviation of 25 
observations as 20 and 4 respectively. It was later discovered at the time of 
checking that he had copied two values as 7 and 18 while the correct values 
were 13 and 17. Find the correct value of coefficient of variation. 

Ans. 16.68 % 


Number of matches 
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Q.34 Mean and variance of marks of section A are 62 and 16 respectively and that of 
section B are 69 and 25 respectively while the number of students are 30 and 
40. Find out combined mean. Which section has greater absolute dispersion? 
Which has greater relative dispersion? 

Ans. 66, B has greater absolute dispersion, B has greater relative dispersion. 

Q.35 Given the following data. Determine the mean and standard deviation of the 
combined set: 


Ans. 23.18, 3 ’ 
Q.36 The following are some of the particulars of the distribution of weights of boys 
and girls in a class. 


Number 


Mean weight 


Variance 


Find arithmetic mean and standard deviation of the boys and girls taken 
together. 

Ans. 55, 7.572 

Q.37A calculating machine while calculating mean and standard deviation of 25 
readings misread one observation as 36 instead of 26. The following results 
were given by the machine. S.D. = 5 and X = 30. What are the correct values of 


X and S.D.? 

Ans, 29.6, 4.9 : 

Q.38 A group has arithmetic mean of 10 from 60 items with variance 4, A subgroup 
of 40 items has mean 11 and standard deviation 1.5. Find the mean and 
standard deviation of other subgroup. 

Ans. 8, 1.225 


Q.39The mean of 5 observations is 4.8 and variance is 4.56. If three of the five 
observations are 2, 5 and 6, find the other two observations. 


Ans. 8 and 3. 

Q.40 Calculate the first four moments about the mean for the observations 
81, 87, 90, 93, 94, 95, 98, 104, 105. 

Ans. 0, 52.54, — 40.56, 6249.26 


[Chapter 4] Measures of Dispersion 203 
eee SSSSSSSSSSSSSSSSCFSCS~ OS 


Q.41 Calculate the first four moments about the mean for the following data. Also 
calculate b, and b, . 


Ans. 0, 3.04, 4.86, 35.80; 0.84, 3.87. 


Q.42 Calculate first four moments about X = 0 from the following data. Also 
calculate the moments about the mean. 


Frequency 


Ans. 12.08, 172.92, 2763.02, 47955.73; 0, 26.99, 21.98, 1964.62 
Q.43 Given the following information: 


Uf= 76, LfX = 672, VEX® = 4848, LEX® = 44240, VEX4= 425280. 
Calculate first four moments about the mean and test the distribution for 
symmetry and normality. 

Ans. 0, 7.09, — 4.99, 119.36; b,=0.07, b, = 2.37 

Q.44Find the first four moments about the mean for the data given below taking 
nis 4 ; 22 ‘ 


Ans. 0, 39,25, — 26.25, 3843.75 

Q.45 The first three moments of a distribution about the value 2 of a variable are 1, 
16 and — 40. Show that the mean is 3, variance is 15 and m3 =— 86. 

Q.46 The first three moments of a distribution about the value X = 4are 1,4 and 10. 
Is the distribution symmetrical, positively skewed or negatively skewed? 

Ans. Yes, the distribution is symmetrical. 


Q.47The first four moments of a distribution about X = 2 are 1, 2.5, 5.5 and 16. 
Calculate the mean and the coefficient of variation. 


Ans.X= 3, C.V. = 40.67% 


Q.48 The first four moments of a set of numbers about 3 are — 2, 10, - 26 and 50, 
Determine the corresponding moments about the mean. 


Ans.m=0, mz= 6, ms = 19, ma= 42 

Q.49 The first four moments about X = 17.5 of a distribution are 0.3, 74, 45 and 
12125, Find out whether the distribution is leptokurtic or platykurtic. 

Ans. platykurtic. 
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Q.50 Following are the data relating to weights of students. Calculate the coefficient 


of skewness by Karl Pearson's method. 
Weights (pounds) | 118-126 | 127-135 | 136-144 | 145 - 153 


[Numberctendens| a |e |, [| « | a1 


Ans: 0.09 


Q.51Compute the coefficient of skewness using the averages and standard 
deviation. 


[asks [0-10 | 10-20 | s0-00 | 00-00 | w00 | 
[rreaueny | « | 2 | w | s | 2 | 


Ans, — 0.05 
Q.52 Find the Bowley's coefficient of skewness for the data given below: 


[Grows | 10-19 | 20-29 [0-0 [0-0 | 0-9 | @- | 
renee | « | | «| 2] | 0 | 


Ans. 0.044 


Q.53 For each of the following unimodal distributions examine whether the 
distribution is symmetrical, skewed to the left or skewed to the right. 


@ Mean = 19, Mode = 62 
Gi) Q, = 136, Median = 160, Q,= 184 
(ii) Mean = 78, Median = 61 
(iv) Q, = 14, Median = 29, Q,= 53. 


Ans. (i) left skewed (ii) symmetrical (iii) right skewed (iv) right skewed 


Q.54From the marks secured by 120 students in section A and 120 students of 
section B of a class, the following measures are obtained. 


AM. = 47.83 | Mode = 47.07 


Describe which distribution of marks is more skewed. 


Ans. Coefficient of skewness (section A) =— 0.327. | Distribution A is more 
Coefficient of skewness ( section B ) = 0.051 skewed than distribution B, 
Q.55(i) Given: Ef = 120, LfX = 296, Mode = 2.944 and second moment about 
mean is 1.4884. Calculate coefficient of skewness. 


(ii) The lower and upper quartiles of a distribution are 142.36 and 167.73 
respectively, while median is 153.50. Find coefficient of skewness. 


Ans. (i) -0.39 (ii) 0.122 


Chapter 


INDEX NUMBERS 


5.1, INTRODUCTION 

Index numbers are meant to study the changes in the effects of such factors 
which cannot be measured directly. According to Bowley, "Index numbers are used 
to measure the changes in some quantity which we cannot observe directly". For 
example, changes in business activity in a country are not capable of direct 
measurement but it is possible to study relative changes in business activity by 
studying the variations in the values of some such factors which affect business 
activity, and which are not capable of direct measurement, 

Index numbers are commonly used statistical devices for measuring the 
combined fluctuations in a group of related variables. If we wish to compare the 
price level of consumer items today with that prevalent ten years ago, we are not 
interested in comparing the prices of only one item, but in comparing some sort of 
average price levels. We may wish to compare the present agricultural production or 
industrial production with that at the time of independence, Here again, we have to 
consider all items of production and each item may have undergone a different 
fractional increase (or even a decrease). How do we obtain a composite measure? 
This composite measure is provided by index numbers which may be defined as a 
device for combining the variations that have come in a group of related variables 
over a period of time, with a view to obtain a figure that represents the 'net' result of 
the change in the constituent variables. 

Index numbers may be classified in terms of the variables that they are 
intended to measure. In business, different groups of variables in the measurement 
of which index number techniques are commonly used are (i) price (ii) quantity 
(iii) value and (iv) business activity. Thus, we have index of wholesale prices, index 
of consumer prices, index of industrial output, index of value of exports and index of 
business activity etc. Here we shall be mainly interested in index numbers of prices 
showing changes with respect to time, although methods described can be applied to 
other cases. In general, the present level of prices is compared with the level of 
prices in the past. The present period is called the current period and some period in 
the past is called the base period. 

5.2. INDEX NUMBERS 

Index numbers are statistical measures designed to show changes in a variable 
or group of related variables with respect to time, geographic location or other 
characteristics such as income, profession, etc. A collection of index numbers for 
different years, locations, etc., is sometimes called an index series, i 
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5.3. SIMPLE INDEX NUMBER 


A simple index number is a number that measures a relative change in a single 
variable with respect to a base. 


5.4. COMPOSITE INDEX NUMBER 


A Composite index number is a number that measures an average relative 
‘change in a group of related variables with respect to a base. 


5.5. TYPES OF INDEX NUMBERS 
Following types of index numbers are usually used: 


2 


5.5.1. Price Index Numbers 


Price index numbers measure the relative changes in prices of a commodities 
between two periods. Prices can be either retail or wholesale. 


5.5.2. Quantity Index Numbers 


These index numbers are constructed to measure changes in the physical 
quantity of goods produced, consumed or sold of an item or a group of items. 


5.6. USES OF INDEX NUMBERS 
The main uses of index numbers are given below: 


(i) Index numbers are used in the fields of commerce,, meteorology, labour, 
industries etc. 


(ii) The index numbers measure fluctuations during intervals of time, group 
differences of geographical position or degree etc. 


Gii) They are used to compare the total variations in the prices of different 
commodities in which the unit of measurements differ with time and place etc. 


(iv) They measure the purchasing power of money. 
(v) They are helpful in forecasting the future economic trends. 


(vi) They are used in studying difference between the comparable categories of 
animals, persons or items. 


(vii) Index numbers of industrial production are used to measure the changes in the 
level of industrial production in the country. 


(viii) Index numbers of import prices and export prices are used to measure the 
changes in the trade of a country. 


(ix) The index numbers are used to measure seasonal variations and cyclical 
variations in a time series. 


5.7. LIMITATIONS OF INDEX NUMBERS 
(i) They are simply rough indications of the relative changes. 


(ii) The choice of representative commodities may lead to fallacious conclusions as 
they are based on samples. 
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(iii) There may be errors in the choice of base periods or weights etc. 
(iv) Comparisons of changes in variables over long periods are not reliable. 
(v) They may be useful for one purpose but not for the other. 


(vi) They are specialized types of averages and hence are subject to all those 
limitations with which an average suffers from, 


5.8. MAIN STEPS TO CONSTRUCT PRICE INDEX NUMBERS 
The following steps are considered for the construction of price index numbers: 
5.8.1. Object 


The first and the most important step in the construction of index numbers is 
to decide the object of making the index numbers of prices. The prices may be retail 
or whole-sale, The index numbers of retail prices are called the consumer price index 
(CPI) numbers and if the whole-sale prices are taken into consideration, the index 
numbers are called the whole-sale price index numbers. The index numbers of prices 
may be calculated for a certain locality, for a certain class of people like textile 
workers or office clerks etc. The index numbers may be required for geograjfhical 
regions like districts or provinces etc. First of all we decide thé purpose of making 
the index numbers. Once the purpose is decided, then we decide about the scope and 
the area or the people who are to be considered. 


5.8.2. Selection of Commodities 


A list of important commodities is prepared. Those commodities are taken into 
account which are commonly consumed by the consumers. There is no hard and fast 
rule about the number of commodities. Only those commodities are considered on 
which a reasonable amount is spent. The commodities on which the expenditure is 
meager or they are used only occasionally are not included in the list. Thus the 
commodities which are representative of the tastes and customs of the people are 
taken in the list. Dr. Irving Fisher has said that 20 commodities is a small number 
and 50 commodities is a reasonable number. For construction of wholesale price 
index numbers, about 80 commodities are taken in the list and for retail-price index 
numbers about 300 commodities are considered. Sometimes the index numbers of 
very important commodities like wheat, rice, oil, ghee ete. are calculated. These 
index numbers are based on about one dozen commodities and are called sensitive 
price index numbers. 


5.8.3. Collection of Price Data 


The most important and difficult step is the collection of prices, The prices are 
to be taken from the field. For retail price index numbers, retail prices are needed. 
The prices change from place to place and from time to time. On different shops the 
prices are different. In actual practice there are many difficulties. Usually some 
representative shops from where the consumers mostly purchase their items are 
selected and the prices are taken from those shops. The prices are taken on daily 
basis and then the weekly and monthly averages are calculated. Finally quarterly or 
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yearly averages are calculated. Some commodities are sold in different varieties. 
Rice, sugar, mangoes etc. have different varieties which are sold on different prices. 
This problem is solved by assigning due weights to different varieties and then 
weighted average price is calculated. Sometimes different varieties are treated as 
different commodities. 


For whole-sale prices, the prices are taken from the whole-sale markets, 
factories, depots and the whole-sale agencies. The whole-sale prices are usually 
stable, therefore these prices are not taken on daily basis. The price reporting is 
done on weekly or monthly basis depending upon the nature of the commodity. The 
prices of some commodities are controlled by the government. These prices are 
reported whenever some change takes place. 

5.8.4. Selection of Base Period 

The prices of the commodities in the current period are to be compared with the 
prices of some period in the past. This period in the past is called the base period or, 
the reference period, The base period is decided by the experts. For the construction 
of national index numbers, the base period is decided by statistics division, 
Government of Pakistan. This period should not be in the remote past. The period 
which is economically stable and is free of disturbances and strikes is taken as the 
base period. 

There are two methods of selecting the base period. These are fixed base 
method and chain base method. They are discussed below: 


(a) Fixed Base Method 


In fixed base method, a particular year is generally chosen arbitrarily and the 
prices of the subsequent years are expressed as relatives of the prices of the base 
year. Sometimes instead of choosing a single year as the base, a period of a few years 
is chosen and the average price of this period is taken as the base year's price. The 
year which is selected as a base should be a normal year or in other words, the price 
level in this year should neither be abnormally low nor abnormally high. If an 
abnormal year is chosen as the base, the price relatives of the current year 
calculated on its basis would give misleading conclusions. For example, a year in 
which war was at its peak, say the year 1965, is chosen as a base year, the 
comparison of the price level of subsequent years to the prices of 1965 is bound to 
give misleading conclusions, The reason is that the price level in the year 1965 was 
abnormally high. In order to remove this difficulty associated with the selection of a 
normal year, the average price of a few years is sometimes taken as the base price. 
The fixed base method is used by the Government in the calculation of national 
index numbers. 


In Fixed Base, 


: 2 _ Price of current year 
Price relative for current year = Price of base year x 100 


ae P,, = EB x 100 
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Example 5.1. 


Given the prices of rice ( Basmati ) in Multan in rupees per quintal from 1987 
to 1992. Calculate the simple index numbers with 1987 as base. 


1987 1988 1989 1990 1991 


Prices of rice 618.25 741.00 749.00 789.00 927.00 | 1034.00 
(Rs. per quintal ) 


Source: Directorate of Agriculture ( Economics and Marketing ) Punjab, Lahore. 
Solution: 


The necessary calculations are given below: 
Index No, = Bn x 100 
Po 


env mae ( Rs.) 
i (1987 as base ) 


ee 
ae 
POOF ep sale 
eee 
BOO, oy. 10004 
ok ieria 


Example 5.2. 


The following are average whole-sale prices of wheat ( Maxi-Pak ) in Lahore in 
rupees per quintal for the period 1983 to 1992. Compute the index numbers using 
(i) average of first four years as base (ii) average of all the ten years as base. 


[| 
186,25 195.00 200.00]210.19 


Source: Bureau of Statistics Government of the Punjab Lahore. 
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Solution: ; 
The necessary calculations are given below: 
@ ie relatives (ii) Price relatives 
4 Pn 
= 22100 =—x 100 
pane a ( Rs.) =e x => x 
, ( Average of first ( Average of 10 
4 ¥ 4 years as base ) years as base ) 
186.25 186.25 
1983 186.25 ‘197.86 * 100 = 94.13 244.63 * 100 = 76.14 
195.00 40 195.00 
1984 195.00 197.86 * 100= 98.55 244.63 * 100 = 79.71 
200.00 200.00 
1985 200.00 t97.86 * 100= 101.08 244.63 * 100 = 81.76 
210.19 210.19 He 
1986 210.19 197.86 x 100 = 106.23 244.63 x 100 = 85.92 
230.31 230.31 
1987 230.31 197.86 x 100 = 116.40 jet 244.63 x 100 = 94.15 
237.50 237.50 
1988 237.50 197.86 * 100 = 120.03 244.63 * 100 = 97.09 
243.00 243.00 
1989 243.00 197.86 * 100 = 122.81 244.63 * 100 = 99.33 
sais 
287.00 = 287.00 
1990 287.00 197.86 * 100 = 145.05 244.63 x 100 = 117.32 
294.00 2 294.00 
1991 294.00 197.86 * 100 = 148.59 244.63 x 100 = 120.18 
363.00 363.00 
1992 363.00 197.86 x 100 = 183.46 244.63 x 100 = 148.39 
pn = 2446.25 
(i) The average of first four years is 
186.25 + 195.00 + 200.00 + 210.19 791.44 
186.25 + 198.00 + 200.00 #210.19 = —“F- = 197.86 =p, 
(ii) The average of all the ten years is 
ZPy _ 2446.25 
eh 10 = 244.625 or 244.63 = 
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(b) Chain Base Method 


In this method, there is no fixed base period. The year immediately preceding 
the one for which price index have to be calculated is assumed as the base year. 
Thus, for the year 1994 the base year would be 1993, for 1993 it would be 1992, for 
1992 it would be 1991 and so on. In this way there is no fixed base. It goes on 
changing. The chief advantage of this method is that the price relatives of a year can 
be compared with the price level of the immediately preceding year. Businessmen 
are mostly interested in comparison of this type rather than in comparisons relating 
to distant past. Yet another advantage of the chain base method is that it is possible 
to include new items in an index number or to delete old items which are no more 
important. In fixed base method it is not possible. But chain base method has a 
drawback that comparisons cannot be made over a long period. 


In Chain Base, 


¥ Ff _ _Price in the current year 
Link relative of current year = Price in the preceding year x 100 


or in symbols, Pn-tn= Po 100 
™ Pn 
Example 5.3. 


Compute chain indices, taking 1987 as base from the data given in 
Example 5.1. 


Solution: 
The necessary calculations are given below: 


; 4 Link relatives 
Prices of rice 


Years (Rar periquintal) Bais 100 Chain indices 
Pai 
1987 618.25 100, 100 
741.00 J 100 x 119.85 
1988 741.00 618.25 *100 = 119.85 100 = 119,85 


749.00 
741.00 


789.00 


119.85 x 101.08 


* 100 = 101.08 100 


= 121.14 


121.14 x 105,34 


789.00 74900 * 100 = 105.34 Too = 127.61 

927.00 ve 127.61 % 117.49 _ ; 

1991 927.00 759 00% 100.,= 117.49 aa = 149.93 
1034.00 149.93 x 111.54 _ 


1992 1034.00 927.00 * }00= 111.54 100 = 167.23 
ok 2 Sak Se ee EES S| 


212 Basic Statistics Part-I 


5.8.5. Selection of the Suitable Average 

There a lfferent averages which can be used in averaging the price relatives 
or link relati, « of different commodities. Experts have suggested that the geometric 
mean should ve calculated for averaging these relatives. But as the calculation of the 
geometric mean is difficult, it is mostly avoided and the arithmetic mean is 
commonly use i In some cases the median is used to remove the effect of the wild 
observations. 
5.8.6. Selection of: table Weights 

In calculation of price index numbers all commodities are not of equal 
importance. In order to give them their due importance, commodities are given due 
weights. Weights are of two kinds (a) Implicit weights, (b) Explicit weights. In the 
first kind the weigifts are not explicitly assigned to any commodity but the 
commodity to which greater importance is attached is repeated a number of times. A 
number of varieties of such commodities are included in the index number as 
separate items. Thus, if in an index number wheat is to receive a weight of 3 and 
rice a weight of 2, three varieties of wheat and two varieties of rice would be 
included. In this method weights are not apparent, but items are implicitly 
weighted. Such weights are known as implicit weights. In the second kind, weights 
are explicitly assigned to commodities. Only one variety of a commodity is included 
in the construction of index number but its price relative is multiplied by the figure 
of weight assigned to iv. Explicit weights are decided on some logical basis. For 
example, if wheat and rice are to be weighted in accordance with the value of their 
net output and if the ratio of their net output is 5 : 2, wheat would receive a weight 
of five and rice two. Such weights are called explicit weights. Sometimes the 
quantities which are consumed are used as weights. These are called quantity 
weights. The amount spent on different commodities can also be used as their 
weights. These are called the value weights. 
5.9. UNWEIGHTED INDEX NUMBERS 

There are two methods of constructing unweighted index numbers: 
5.9.1. Simple Aggregative Method 

It is obtained by dividing the sum of the prices of current year by the sum of 
the prices of base year and expressed as percentage i.c., 

P= =" 100 

Where, Zp, represents the sum of prices for the current year, and Lp, 
represents the sum of prices for the base year. 
5.9.2, Simple Average of Relatives 

In simple average of relatives we find price relatives of the given commodities 
and then average them. Any one of the average, mean, median and geometric mean 
may be used. If we use arithmetic mean, we have 


1 
Pon = x2(2) x 100 


Where, N represents the number of commodities. 
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Example 5.4. 


The following are the prices of four different commodities for 1990 and 1991. 
Compute a price index by (i) Simple aggregative method and (ii) Average of price 
relatives method by using both arithmetic mean and geometric mean, taking 1990 as 


base. 
(American ) | (Maxi-Pak 


Fee 

(Rs. per quintal ) 

Ser | 
(Rs. per quintal ) 

Source: Bureau of Statistics Government of the Punjab Lahore. 


Solution: 


The necessary calculations are given below: 


Price in 1990 Price in 1991 Price relatives 
Commodity | ( Rs. per quintal) | (Rs. per quintal) p= Pn, 100 log P 
te Po Pa Po 
874 =I 
Kappas 909 874 G09 * 100 = 96.15 1.9829 
' 305 
Wheat. 288 3805 288 * 100 = 105,90 2.0249 
‘ 910 
Rice 767 910 767 * 100 = 118.64 2.0742 
573 “ 
Gram 659 573 659 * 100 = 86.95 1.9393 
Pn 
=z (2) * 100 = 407.64 


(i) Simple Aggregative Method: 


2Pn _ 2662 4 
Zp, 100 = 353% 100 = 101.49 


(i) Average of Price Relatives Method ( using arithmetic mean ): 
2) bof ha eg as 
Pa= Wz (2) *100 = 4 (407.64) = 101.91 
Average of Price Relatives Method ( using geometric mean y 


Py = Antilog (ee?) = Antilog (20218) = 101.28 


Pon = 


4 


a 
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Example 5.5. 


The following table gives the average whole-sale prices in rupees per quintal of 
Kappas ( American ), Wheat ( Maxi-Pak ) and Rice ( Basmati ), during the year 1987 
to 1991. Construct index numbers with 1987 as base using: (i) Arithmetic Mean 
(ii) Median as an average. 


Commodities ( Prices in rupees per quintal ) 


y Kappas Wheat 
pa (American ) | (Maxi-Pak) | (Basmati) 


620.00 


618.00 
719.00 
909.00 
874.00 


Source:- Bureau of Statistics Government of the Punjab Lahore. 
Solution: 


The necessary calculations are given below: 


Price Relatives Index Numbers (1987 as base) 


Kappas Wheat 
(American ) ( Maxi-Pak ) 


618.00 
620.00 * 100 
= 99.68 


E 360.27 g 
767.00 
616.25 “100 | 412.20 | 137.40 
= 124.46 
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6. 


The following table gives the prices of two commodities, Gold Tezabi and Silver 
Tezabi, during the year 1987 — 88 to 1991 - 92. Compute chain index numbers using 
1987 — 88 price as base. 


1987 


1988 


- 88 
— 89 


Gold Tezabi 
(per 10 grams ) 


8045 
2892 


Commodities Prices in Rupees 


Silver Tezabi 


(per kilogram ) 


1989 - 90 2972 5193 
1990-91 3080 4758 
1991-92 3140 4342 
cau 
Source: Federal Bureau of Statistics 
Solution: 
The necessary calculations are given below: 
Link relatives 
Year Gold Tezabi Silver Tezabi Total Mean Chain Indices 
1987-88 100 100 200 | 100 100 
‘Saen  eee eee ee = 
2892 5333 100 x 101.14 
1988-89] 3045 * 100 4970 “100 | 292.98] 101.14 100 
= 94.98 = 107.30 = 101.14 
2972 99 5193 101,14 x 100.07 
1989-90] 2892 5333 *200 | 999.14] 100.07 100 
= 102,77 = 97.37 = 101.21 
3080 4758 101.21 x 97.68 
1s90-91| 2972%1200 | 593% 100 | 95051 97.63 100 
= 103.63 = 91.62 = 98.81 
j}—_+ ial a gE 
3140 , 59 4342 98.81 x 96.61 
1991-92] 3080 4758 °° 100 | 193.01] 96.61 100 
= 91.26 = 95.46 


He. = 101.95 
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Example 5.7. 


Find chain indices from the following price relatives of three commodities, 
using geometric mean as average. 


Commodities 


Solution: 


The necessary calculations are given below: 


262.93 
100 
= 226.17 
226.17 x 132.83 
100 
= 300.42 
100 
= 307.24 
307.24 x 73.45 
100 
= 225.67 


5.10, WEIGHTED INDEX NUMBERS 

There are two methods of calculating weighted index numbers. 
5.10.1. Weighted Aggregative Price index Numbers 

In this method we calculate the total expenditure of the current year and the 
base year. Price of cach commodity is niultiplied with the weight of the commodity 
which is usually the quantity consumed or quantity produced. The quantity of the 
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base year (q,) or current year (q,) can be used as weight. The current period 
expenditure is compared with the base period expenditure. The formula used is 


Current year expenditure x 100 


Price Index Base year expenditure 


There are various kinds of weighted aggregative index numbers, some 
important formulas are discussed below: 


=Pn do 
ZPo do 


(i) Laspeyre's Index: P,,= * 100 


This is also called base year weighting system because the base year quantities 
q. are used as weights. 
2Pn Gn 
wo 
Zpo dn 


This is also called current year weighting system because current year 
quantities q,, are used as weights. 


(iii) Fisher's Ideal Index: 
It is the geometric mean of the Laspeyre's and Paasche's index. 


(ii) Paasche's Index: Pon = 100 


PB. RS ~/Laspeyre's Index x Paasche's Index 


ZPndo 2Pndn ZPndo  =Pndn 
* 100. *100 = x x 100 
ses ZPodn ZPodo  =Podn 


This index lies between Laspeyre's and Paasche's indices. 
5.10.2. Weighted Average of Relatives Price Index Numbers 


In this method the unweighted index number is converted into weighted index 
number. The weights used are values. These values are estimated on the basis of the 
aggregate expenditure in the base year. Weights are in proportion of these values. 
The aggregate expenditure of a commodity in the base year is calculated by 
multiplying quantity with price. The index number for the current year is calculated 
by dividing the sum of the products of the current years price relatives and base 
years values by the total of the weights. Symbolically 


Weighted Average of Relatives = 2a 
2W 
Pn 
where I = pan 100 and W = p,q 
0 


Example 5.8. 
Construct index numbers of prices from the following data by applying: 


(i) Laspeyre's method (ii) Paasche's method and (iii) Fisher's Ideal method 
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Commodities 


Solution: 


Base year 


Current year 


Price Quantity 


The necessary calculations are given below: 


@) — Laspeyre's method: 


(ii) Paasche's method: 


(ii) Fisher's Ideal method: Py, 


Example 5.9. 


=pigo = 


Lpoqi = 
Compute: 


Po 


wb 

S 

o 
il) 


U 
e 
oS 
i=) 

iT] 


Po = 


ZP140 
=Podo 


Epo 


139.88 


ui 1880 
a /7o00 , 1880 * 100 = 139.79 


Given the following information; 


4220, — Xpogo = 3520,  Epeqo 
4020, Lpzqz = 6896, Lpoge 
(i) Laspeyre's index aumbers 
(ii) Paasche's index iiumbers 
(iii) Fisher's Ideal indes numbers 


i 


5460, 
4462. 


Xpiqi = 4810, 
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Solution: 
The necessary calculations are given below: 
(i) Laspeyre's Price Index Numbers: 
EP ido 4220 


Por = FpgeX 100 = g599% 100 = 119.89 

ee sot x 100 is 5460 .. 100 = 155.11 
(ii) Paasche's Price Index Numbers: 

pais Fam « 100 = Bx 100 = 119.65 

ie Fil x 100 = $898 100 = 154.55 


(iii) Fisher's Ideal Price Index Numbers: 


Epido , ZPid1 4220 4810 
eth SN anon 100 = 
Epo * Epoar * °° 3520 * 4020 * 100 = 119.77 


ZP2do , LP2d2 [5460 6896 
Sel aaa oe 3520 * 4469 * 100 = 154.83 
Example 5.10. 

The following data relate to the prices and quantities of three commodities in 
the years 2002 and 2003. Construct the following index numbers of price for the year 
2003 by taking 2002 as the base year. 

(i) Base year weighted (ii) Current year weighted. 


Prices ( Rs. per kg.) Quantities ( kgs.) 
2002 2008 2002 2003 


A 6 10 50 60 
B 2 2 100 120 
Cc 4 6 60 60 


Solution: 
The necessary calculations are given below: 


[conmodiion |. [mo [oe [a [rm [rm [mm | ma] 
A 6 10 50 
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(i) Base year weighted index for 2003, 
=p1do 00 1060 


Po.= Epo x1 = “749 * 100 = 143.24 
oo 
(ii) Current year weighted index for 2003, 
D>} 
Pa= Foltx 100 = 200100 = 142.86 
ot 


5.11. CONSUMER PRICE INDEX NUMBERS 

A consumer price index number is an index number which is made to measure 
the relative change in purchasing a specified basket of goods and services between 
two periods for a certain locality for fixed income group of people. Consumer price 
index numbers are also called as cost of living index numbers or retail price index 
numbers. The basket of goods and services will include items like: 

(i) food (ii) fuel and lighting (iii) house rent (iv) clothing (v) miscellaneous. 
5.12. MAIN STEPS TO CONSTRUCT THE CONSUMER PRICE INDEX 
NUMBERS 

The main steps involved in the construction of consumer price index numbers 
or cost of living index numbers are given below: 
5.12.1. Scope 

The first and most important step for constructing a consumer price index 
number is that we should know the scope of our index number. We should specify 
the area and locality for which index number is being made and also the income 
group of the population living in that locality. For example industrial workers, clerks 
or low salaried workers etc. 
5.12.2. Family Budget Enquiry and Allocation of Weights 

The next step is to: conduct a family budget enquiry, The enquiry should 
include questions about quality and quantity of goods and services consumed and 
amount of money spent on them under various items. For example food, fuel and 
lighting, clothing, house rent and miscellaneous. The miscellancous group includes 
items like amusement, education, medicine, gifts, newspaper, transport, barber etc. 
5.12.3, Collection of Price Data 

The next step is to collect price data for the basket of goods and services for two 
periods i.e. base year and current year. The price quotations should be taken from 
those shops which are situated in that locality for which index number is being 
constructed. 
5.12.4. Computation of Consumer Price Index Numbers 

There are two methods to compute consumer price index numbers. 
(1)! AGGREGATE EXPENDITURE METHOD 


In this method, the quantities of commodities consumed by the particular 
group in the base year are estimated and these figures or their proportions are used 
as weights. Then the total exnenJditur each commodity for each year is 


[Chapter 5] Index Numbers : 221 


calculated. The price of the current year is multiplied by the quantity or weight of 
the base year. These products are added. Similarly for the base year total 
expenditure on each commodity is calculated by multiplying the quantity consumed 
by its price in the base year. These products are also added. The total expenditure of 
the current year is divided by the total expenditure of the base year and the 
resulting figure is multiplied by 100 to get the required index numbers. In this 
method, the current period quantities are not used as weights because these 
quantities change from year to year. 


= 2PnMe . 100 


on 2P odo 
where, p, represent the prices of the current year, 
p, represent the prices of the base year and 
q, represent the quantities consumed in the base year. 
(2) FAMILY BUDGET METHOD 
In this method, the family budget of a large number of people are carefully 
studied and the aggregate expenditure of the average family on various items is 
estimated. These values are used as weights. Current year's prices are converted 
into price relatives on the basis of base year's prices and these price relatives are 
multiplied by the respective values of the commodities, in the base year. The total of 
these products is divided by the sum of the weights and the resulting figure is the 
required index numbers. 


=WI Pn 


Po = Sy Where I mee 100 and W = pq 


This can be written as 


Pn 
ise! (2). 100( Po do) Bn de, 199 
a EPo do ~  2Po do 

This formula is the same thing as used above in aggregate expenditure method, 
Thus both the methods i.e., the aggregate expenditure method and family budget 
method will give the same answer. 
Example 5.11. 

Compute the cost of living index number by using weighted average of price 
relatives method. 


Prices in rupees 


13 8 6 


222. 
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Solution: 


The necessary calculations are given below: 


Base year 7 
price in Rs. 
Po 


Current year 
price in Rs. 
Pi 


Weights 


Items (wy 


8 
13 * 100 = 61.54 


22 
B 5 15 * 100 = 146.67 
185 
ic 4 349 * 100 = 74.30 
250 : 
D 1 39g * 100 = 76.22 


ang Linil bod 
497 * 100 = 90.14 


WI 


733.35 


297.20 


76.22 


=WI 
= 1656.29 


Cost of living index number (by weighted average of price relatives method) 


WI _ 1656.29 
=w 4B 


Example 5.12, 


= 92.02 


An enquiry into the budgets of middle class families in a city gave the following 


information. 


Rent 
15% 


Fuel 
10% 


Clothing 
20% 


Expenses on 


Prices in 2000 ( Rs.) 


Prices in 2001 ( Rs.) 


Education 
25 % 


What is the change in the cost of living ‘igure in 2001 as compared with 2000? 
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Solution: 
The necessary calculations are given below: 
Prices 
2 Weight Product 
vee 000 | 2001 eig. 2 Ba roduc’ 
Commodities Pe 1 Ww I= Po x 100 WI 
3600 
Food 3000 | 3600 30 aq09 * 100 = 120.00 3600.0 
3000 
-——_——_— Sis 
600 
Rent 600 600 15 600 * 100 = 100.00 1500.0 
_ 
‘ 2400 
Clothing 2100 | 2400 20 2100 * 100= 114.29 2285.8 
|Z eal oe ae = 
Fuel 600 | 600 | 10 | $88 109 = 100.00 1000.0 
: 1650 
Education 1200 | 1650 25 1200 * 100 = 137.50 3437.5 
ZW = 100 + ELWI = 11823.3 


z 11823. 

Cost of living index number for 2001 = mm sige = 118.233 or 118.2. 
Since the cost of living index number for 2001 is greater than 100, we therefore 
conclude that prices in 2001 have increased 18.2 % as compared with the prices in 
2000. 

Example 5.13. 


Construct the consumer price index number for 2008 on the basis of 2002 from 
the following data using: (i) Aggregate expenditure method and (ii) Family budget 
method, 


Prices 


Quantity Consumed in 2002 | Unit of price 2002 2003 


6 quintal per quintal 


[aint | perguntat | 0 | 
4 kilogram per kilogram | 12.00 


1020.00 | 1015.00 


Commodities 


per quintal 
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Solution: 
(i) Consumer price index number of 2003 here Aggregate expenditure method: 


Quantity 


[ Prices | 
consumed 
in 2002 


price 
qo 
Een IE neeeN 
sana a cata 
ee er rc 
5 in rte zon a0 


1 quintal per quintal | 1020.00 | 1015.00 


Commodities 


Consumer price index number of 2003 is: 
2P1 4 ,. 100 8884 . 100 


= 106.06 


One peda 8376.5 
(i) Consumer price index number of 2003 by Family budget method: 


mee Sea aes 
[= [rea [aso | so 


=IWIl= 
888393.56 


Consumer price index number of 2003 is: 


=WI 888393.56 
Po = Tw = 93765 7 106.06 
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Example 5.14. 


Calculate the price index number for 1994 on the basis of 1984 from the 
following data using aggregate expenditure method. 


Commodities Prices (Rs. per kg.) | Prices ( Rs. per kg.) Values 
1984 1994 


Solution: 
Values mean the expenditure on different commodities. Thus values = Podo- 


The necessary calculations are given below: 


rb — 
Prices ( Rs.) | Prices ( Rs.) 
Commodities 1984 1994 ae Podo 
Po Pi Poo * Pe Pido 
i Ele lh 
100 
Wheat 2.50 4.50 100 2.507 40 180 
; 180 
Rice 9.00 16.00 180 9.00 > 20 320 
Sugar 8.00 16.00 80 80 19 160 
Ag: A A 8.00 = 
Ghee 12.00 40.00 60 200 
Epodo = 420 Zpid. = 860 


Price index by Laspeyre's formula: 


Zid 


Pou = Spsqo 


860 
*100 = 420 * 100 = 204.76. 


Note: From the given data only the Laspeyre's price index can be calculated. 
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SHORT DEFINITIONS 
\Andex Number 
An index number is a measure of change in magnitude relative to a fixed base value. 
or 


An index number is a ratio used to measure relative change between two time 
periods. 
Simple Index Number ~~ 
A simple index number is based on the relative changes in the price or quantity of a 
single commodity. or 
A pr index measures the relative change from the base period for a single 
commodity. 
omposite Index Number 
A composite index number represents combinations of the prices or quantities of 
several commodities. or 
A composite index ae measures relative change from the base period for a 
group of items. 
Price Index Number 
An index that measures a change in prices paid or received by consumers or 
producers is called a price index number. or 
An index number constructed to measure the change in the price of an item or a 
group of items is called a price index number. 
(Quantity Index Number 
An index that is designed to measure changes in quantities with respect to time. or 
An index number constructed to measure a change in the physical quantity of goods 
produced, consumed or sold of an item or a group of items is called quantity index 
number. 
\-Price Relative 
The price relative is defined as the price in the current period divided by the price in 


the base period multiplied by 100 i.e; Pon = S x 100. 


\\ Fixed Base Method 
In fixed base method, one of the time periods is chosen as the base and rest of the 
values of the various time periods are divided by the base period value and the 
results are expressed in percentage form. 
ink Relative ’ 
A link relative is obtained by dividing the current period price by the immediately 


“x 100. 


preceding period price into 100 i.e; Pa_i,n= = 


Chain Base Method 

Chain base method is used to convert the link relatives into fixed base method. The 
process of changing the link relatives into fixed base is known as chaining process. 
The average of the link relatives for the beginning year is taken as chain index for 


t ‘7 sith Unde 
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that year. This chain index is multiplied by the average of the link relatives of the 
next year and divided by 100 and so on. The values obtained in this way are known 
as the chain indices. 
Unweighted Index Number 
It is a device that measures the relative charge in a group of variables after ignoring 
the relative importance of the variables. 
Simple Aggregative Price Index Number 
A simple aggregative price index number is the ratio of a sum of prices in current 
period to the sum of prices in the base period, expressed as a percentage i.e; 
Pon = es 100. or 
A simple aggregative price index number is obtained by dividing the sum of current 
year prices by the sum of base year prices and multiplying by 100. 

\-Weighted Index Number 
An index that measures the relative change in a group of variables keeping in view 
the relative importance of the variables. 
Weighted Aggregative Price Index Number 
A weighted aggregative price index is the ratio of an aggregate of weighted 
commodity prices for a given year to an aggregate of the weighted prices of the same 
commodities in some base year, expressed as a percentage. or 
A weighted aggregative price index number is the ratio of a weighted sum of prices 
in current year to a weighted sum of prices in the base year, expressed as a 
percentage. 
Laspeyre’s Price Index Number 
The Laspeyre’s price index number is the ratio of the total cost in the given year of 
the quantity of each commodity consumed in the base year to the total cost of these 
quantities in the base year, expressed as a percentage. In symbols, the Laspeyre’s 
index is( Pon = Pc 100. 5 
Paasche’s Price Index Number 


The Paasche’s price index number is the ratio of the total cost in the given year of 
the quantity of each commodity consumed in the given year to the total cost of these 
quantities in the base year, expressed as a percentage. In symbols, the Paasche’s 


index in{Po = ie 109 


\ €onsumer Price Index Number or Cost of Living Index Number 
A cost of living index number is an index number which is designed to measure the 
relative change in purchasing a specified basket of goods and services between two 
periods for a certain locality for fixed income group of people. or 

E-& monthly price index that is used to measure the change in prices of basket of 
goods and services is called consumer price index number, 
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MULTIPLE CHOICE QUESTIONS 


1. An index number is called a simple index when it is computed from: 


(a) single variable (b)  bi-variable 
(c) multiple variables (d) none of them 
2. Index numbers are expressed in: 
(a) ratios (b) squares 
(c) percentages (d) combinations 
3. If all the values are of equal importance, the index numbers are called: 
(a) weighted (b) un-weighted 
(c) composite (d) value index 
4. Index numbers can be used for: 
(a) forecasting (b) fixed prices 
(c) different prices (a) constant prices 
5. Index for base period is always taken as: 
(a) 100 (b) one 
(c) 200 (d) zero 
6, When the prices of rice are to be compared, we compute: 
(a) volume index (b) value index 
(c) price index (d) aggregative index 
7. When index number is calculated for several variables, it is called: 
(a) composite index (b) whole sale price index 
(c) volume index (d) simple index 
8. How many types are used for the calculation of index numbers: 
(a) 2 (b) 3 
(©) 4 @ 5 
9. In chain base method, the base period is: 
(a) fixed (b) not fixed 
(c) constant (d) zero 
10. Which formula is used in chain indices? 
(a) 22x 100 ) —x 100 
=Po Pai 
nt OD LPnIn 
(©) =a 100 (d) Spay x 100 
11. Price relatives is a percentage ratio of current year price and: 
(a) base year quantity (b) previous year quantity 
(c) base year price (d) current year quantity 
12. Indices calculated by the chain base method are free from: 
(a) seasonal variations (b) errors 
(©) percentages (d) ratios 
13. The chain base indices are not suitable for: 
(a) long range comparisons (b)  short.range comparisons 


(c) middle range comparisons (d) all of the above 
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14, An index number that can serve many purposes is called: 
(a) general purpose index (b) special purpose index 
(c) cost of living index (d) none of them 

15. Another name of consumer's price index number is: 
(a) whole sale price index number (b) cost of living index 


(c) sensitive (da) composite 
16. Consumer price index indicates: 
(a) rise (b) fall 
(c) both (a) and (b) (d) neither (a) and (b) 
17. Purchasing power of money can be assessed through: 
(a) simple index (b)  Fisher’s Ideal index 
(c) consumer price index (d) volume index 
18. Cost. of living at two different cities can be compared with the help of: 
(a) value index (b) consumer price index 
(c) volume index (d) un-weighted index 
19. Consumer price index numbers are obtained by: 
(a) Laspeyre's formula (b) Fisher ideal formula 


(c) Marshall Edgeworth formula (d) Paasche's formula 
20. Laspeyre’s index = 110, Paasche's index = 108, then Fisher’s Ideal index is 


equal to: 
(a) 110 (b) 108 
(c) 100 (@ 109 
21. Most commonly used index number is: 
(a) volume index number (b) value index number 
(c) price index number (d) simple index number 
22, For consumer price index, price quotations are collected from: 
(a) fair price shops (b) government depots 
(c) retailers (d) whole-sale dealers 
23. Price relatives computed by chain base method are called: 
(a) price relatives (b) chain indices 
(c) link relatives (d) none of them 
24. Consumer price index are obtained by: 
(a) Paasche's formula (b) Fisher's ideal formula 


(c) Marshall Edgeworth formula (d) Family budget method formula 
25. The aggregative expenditure method and family budget method always give: 


(a) different results (b) ‘approximate results 
(c) same results (d) none of them 
26. In fixed base method, the base period should be: 
(a) for away (b) abnormal 
(c) unreliable (d) normal 
27, If all the values are not of equal importance the index number is called: 
(a) simple (b) unweighted 


(c) weighted (d) none 
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28. 


29. 


30. 


31. 


32. 


33. 


34. 


35. 


36. 


37. 


88. 


39. 


40. 
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Which of the following formula satisfy the time reversal test: 


=P1do _ =p 
(a) Pu Spade b) Po = Epoar 


[pia Epi 
(c) Mee Spodo Spank (d) none of them 


When the price of a year is divided by the price of a particular year we get: 
(a) simple relative (b) link relative 

(c) aandb, both (d) none of them 

When the price of a year is divided by the price of the preceding year, we get: 
(a) valueindex (b) link relative (c) simple relative (a) none of them 
The most appropriate average in averaging the price relatives is: 


(a) median (b) harmonic mean 

(c) arithmetic mean (d) geometric mean 

In constructing index number geometric mean relatives are: 

(a) non-reversible (b) reciprocal 

(c) reversible (d) none of them 

The general purchasing power of the currency of a country is determined by: 
(a) retail price index (b) volume index 

(c) composite index (d) whole-sale price index 


What type of index number can help the government to formulate its price 
policies and to take appropriate economic measures to control prices: 


(a) whole sale price index (b) consumer's price 
(c) quantity (d) none of them 

The most suitable average in chain base method is: 

(a) arithmetic mean (b) median 

(c) mode (d) geometric mean 
Base year quantities as weights are used in: 

(a) Laspeyre’s method (b) Paasche’s method 
(c) Fisher's ideal method (d) difficult to tell 
Chain process is used to make comparisons of price index numbers in: 
(a) price relative (b) link relative 

(c) simple relative (d) none of the above 


In the computation of consumer price index numbers, we use: 
(a) aggregate expenditure method (b) family budget method 


(c) chain base method (d) both (a) and (b) 
The Federal Bureau of Statistics prepares: 
(a) the wholesale price index (b) the consumer price index 


(c) the sensitive price indicator  (d) alll of the above 

While computing a weighted index, the current period quantities are used in the: 
(a) Laspeyre’s method (b) Paasche’s method 

(c) Marshall Edgeworth method (d) Fisher's ideal method 


[Chapter 5] Index Numbers 231 


41, The best method to measure the relative change in prices of commodities is: 


42. 


43. 


45. 


46. 


47. 


48. 


49, 


50. 


(a) quantity index number (b) value index number 

(c) volume index number (d) price index number 

When the base year values are used as weights, the weighted average of 
relatives price index number is the same as the: 

(a) Laspeyre’s index (b) Paasche’s index 

(c) simple aggregative index (d) quantity index 

To measure the relative change in purchasing a specified basket of goods and 
services between two periods for a certain locality for fixed income group of 
people, we can use: 

(a) consumer price index (b) Paasche’s price index 

(c) cost of living index (a) both (a) and (c) 

Fisher's ideal index number is the geometric mean of the: 

(a) Laspeyre’s and Marshall Edgeworth indices 

(b) Laspeyre’s and Paasche’s indices 

(ce) Paasche’s and Marshall Edgeworth indices 

(d) all of the above 

A number that measures a relative change in a single variable with respect to 


_a base is called: 


(a) good index number (b) composite index number 

(c) simple index number (d) quantity index number 

A number that measures an average relative change in a group of related 
variables with respect to a base is called: 

(a) simple index number (b) composite index number 

(c) price index number (d) quantity index number 

An index number constructed to measure the relative change in the price of an 
item or a group of items is called: 


(a) quantity index number (b) price index number 
(c) volume index number (d) difficult to tell 

When relative change is measured for a fixed period, it is called: 
(a) chain base method (b) fixed base method 


(c) simple aggregative method (d) cost of living method 

The ratio of a sum of prices in current period to the sum of prices in the base 
period, expressed as a percentage is called: 

(a) simple price index number 

(b) simple aggregative price index number 

(c) weighted aggregative price index number 

(d) quantity index number 

An index that measures the average relative change in a group of variables 
keeping in view the relative imvortance of the variables is called: 

(a) simple index number (b) composite index number 

(c) weighted index number (d) price index number 
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51. Link relative of current year is equsl to: 


(a) price of current year x 100 6) price of base year x 100 


price of base year price in the preceding year 


© price in the current year x100 @ price in the preceding year x 100 


price in the preceding year price in the current year 
52. Simple average of relatives is equal to: 


Ba Epa 

(a) of x 100 (b) po x 100 

@) = (=) x 100 (d) aE kel x 100 
53. Paasche’s price index number is also called: 

(a) base year weighted (b) current year weighted 

(c) simple aggregative index (a) consumer price index 
54. Laspeyre’s price index number is also called: 

(a) base year weighted (b) current year weighted 

(c) cost of living index (ad) simple ageregative index 
55. Index number having downward bias is: 

(a) Laspeyre’s index (b) Paasche’s index 

(c) Fisher's ideal index (ad) Marshall Edgeworth index 
56. Index number having upward bias is: 

(a) Laspeyre’s index (b) Paasche’s index 

(c)  Fisher’s ideal index (ad) Marshall Edgeworth index 
57. Marshall Edgeworth price index was proposed by: 

(a) one English economist (b) two English economist 

(c) three English economist (d) many English economist 
58. Index number calculated by Fisher's formula is ideal because it satisfy: 

(a) circular test (b) factor reversal test 

(c) time reversal test (d) all of the above 


59. The test which is not obeyed by any of the weighted index numbers unless the 
weights are constant: 
(a) circular test (b) time reversal test 
(c) factor reversal test (d) none of them 


{1. @]2. ©] 3. (b) | 4. (a) | 5. a (a) | 
9. (b) | 10. (6) | 11. (©) | 12. (a) [ 13. (a (c)_ 
17. (c) | 18. (6) [19. (a) | 20. _@) [21. ©) (a) 
25. (c) | 26. (d) | 27. (©) | 28. © | 29. (a (©) | 

| 33. (d) | 84. (6) | 35.) | 36. (a) | 37. ome (b) | 

/[41, d) | 42. (a) 143. @ | 44.) | 45. ©) i 2 .__(b) 

} 49. (b) | 50. (c) | 51. (©) | 52. ) 153. (b) 154. (a) | 55. (a) | 56. (b) 

57. (b) | 58. (@) | 59. (a) 
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Q.9 

Q.10 
Qi 
Q.12 
Q.13 
Q.14 
Q.15 
Q.16 
Q.17 


Q.18 
Q.19 
Q.20 


Q.21 


Q.22 
Q.23 
Q.24 
Q.25 
Q.26 
Q.27 
Q.28 
Q.29 
Q.30 
Q.31 
Q.32 


Q.33 
Q.34 
Q.35 
Q.36. 
Q.37 
Q.38 


SHORT QUESTIONS 
Define an index number. 
Define price index numbers. 
Define simple index number, 
Define composite index number. 
Define weighted index number. 
Define un-weighted index number. 
Define price relative. 
Define link relative. 
Why index numbers are called economics barometer? 
Differentiate between simple and composite index numbers. 
Differentiate between price relatives and link relatives. 
Describe the different types of index numbers. 
What are the important uses of index numbers? 
Explain the fixed base method. 
Describe the chain base method. ¥ 
Distinguish between fixed base and chain base methods. 
Write down the main steps involved in the construction of price index 
numbers. 
Distinguish between weighted and un-weighted index numbers. 
Write down the main limitations of index numbers. 
What are the various ways of assigning weights in the construction of index 
numbers? 
Define index numbers. Write down only main steps which are used for the 
construction of index numbers of prices. 
Define simple aggregative price index number, 
Name the sources of index numbers. 
Define the base period or reference period. 
Differentiate between implicit weights and explicit weights. 
Define Laspeyre's price index number or base year weighted. 
Define Paasche's price index number or current year weighted. 
Write down the methods of calculating weighted index numbers, 
Define weighted aggregative price index number. 
What are the two methods of constructing un-weighted index numbers? 
Explain the meaning of consumer price index numbers. 
Explain the difference between wholesale price and consumer price index 
numbers, 
What is cost of living index number? 
Write down the methods to compute consumer price index numbers. 
Write down the main steps to construct the consumer price index numbers, 
What is CPI? How is it calculated? 
Write short note on quantity index number. 
What are the different types of index numbers? 


234 


Q.39 
Q.40 


Q.41 
Q.42 
Q.43 
Q.44 
Q.45 
Q.46 
Q.47 
Q.48 


Q.49 
Q.50 
Q.51 
Q.52 
Q.53 


Q.54 


Ans, 
*Q.55 
Ans. 


Q.56 
Ans. 


Q.57 
Ans. 


Q.58 


Ans, 
Q.59 


Ans. 
Q.50 
Ans. 
Q.61 
Ans. 


Q.62 


Ans. 
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Explain a market basket. 

What is the relationship between Laspeyre’s price index number, Paasche’s 
price index number and Fisher's Ideal price index number? 

How we find weighted aggregative price index numbers? 

How we compute link relatives in chain base method? 

How we find a value index number? 

How we find simple aggregative price index number? 

How we find Fisher's Ideal price index number? 

How we calculate price relatives in fixed base method? 

How we calculate un-weighted index numbers? 

How many methods are used for selection of base period? Name these 
methods. i 

How we calculate consumer price index numbers? 

How we compute weighted index numbers? 

How we compute base year weighted index number of prices? 

How we calculate current year weighted index for prices? 

Write down the advantages of chain base method. 

Given py = 16, 15, 12, 20, 22 and p, = 20, 12, 15, 22, 22. 


Pp 
Find I where I = = x 100, 
0 


125, 80, 125, 110, 100 

Given p, = 6, 2, 4 and q, = 50, 100, 60. Find SW. 

740 

Given po = 2, 4, 6, qo = 10, 12, 14 and I = 150, 125, 140. Compute SWI 

20760 

Given W = 100, 180, 80, 60 and po = 2, 9, 8, 12. Find qo. 

50, 20, 10, 5. 

Given Xp, = 2550 and Ep, = 2590. Find price index number by using simple 
aggregative method. 

101.57 

Given Lpo = 660, Ypi = 924 and Yp2 = 1056. Compute simple aggregative 
price index number. 

Po: = 140, Pou = 160 

if link relatives ave 100, 120, 102, 105, 118 and 112. Find chain indices. 

100, 120, 122.4, 128.52, 151.65, 169.85 

Given Eppq, = 850 and Zp,qo = 1170. Fine Laspeyre's price index number. 
137.65 

Given Epigo = 1050, Lp2qo = 1120 and Ypogo = 850 Compute base year 
weighted index. 
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Q.63 
Ans. 


Q.64 


Ans. 
Q.65 


Ans. 
Q.66 


Ans. 


Q.67 
Ans. 
Q.68 
Ans. 
Q.69 


Ans. 
Q.70 


Ans. 
Q.71 


Ans. 
Q.72 


Ans. 
Q.73 


Ans, 
Q.74 


Ans, 
Q.75 


Ans, 
Q.76 


Ans. 


Given Ep oq, = 950 and Ep,q, = 1310. Find Paasche's price index number. 
137.89 

Given Ypigi = 1400, Lpzqo = 1600, Lpoqi = 1360 and Lpo gz = 1560. Compute 
current year weighted index numbers. 

Poi = 102.94, Pos = 102.56 

Given Laspeyre's price index number = 120 and Paasche's price index 
number = 119.6. Find Fisher's Ideal price index number. 

119.8 

Given Epoq) = 3600, Ep,qy = 4300, Zpyq, = 4100 and Yp,q, = 4890. Find 
Fisher's Ideal price index number. 

119.36 

Given Eppqy = 1500 and Ep,qy = 2040. Find base year weighted index. 

136 

Given Zpoq, = 1000 and p,q, = 1360. Find current year weighted index. 

136 

Given Fisher's Ideal price index number = 104.3 and Paasche’s price index 
number = 103.2. Find Laspeyre’s price index number. 

105.4 

Given Laspeyre’s price index number = 110 and Fisher’s Ideal price index 
number = 111. Find Paasche’s price index number. 

112 

Given IW = 20 and EWI = 1800. Find cost of living index number by weighted 
average of price relatives method. 

90 

Given ZW = Lp_qy = 8500 and EWI = 892500. Construct consumer price index 
number by family budget method. 

105 

Given W = 45, 15, 12, 8, 20 and SWI = 10500. Compute cost of living index 
number. 

105 

Given W = 20, 25, 30, 40 and I = 100, 105, 110, 120. Find consumer price 
index number. 

110.65 

Given Zp,q9 = 9000 and Zpyq) = 8490. Find and interpret consumer price 
index number by aggregative expenditure method, 

106 


, sip Pa 
Given N( number of commodities )=5 and = ( 
0 


} * 100 = 510, Find price 


index by average of price relatives method. 
102 
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EXERCISES 


Q.1 Find price relatives for the data given below, using (i) 1992 as base (ii) average 
of last 3 years as base: 


Ans. (i) 100, 120, 136, 160, 200, 180, 220, 300, 280 
(ii) 87.50, 45.00, 51.01, 60.01, 75.01, 67.51, 82.51, 112.51, 105.01 


Q.2 Kind index numbers of prices from the following data using the average of last 
five years as base, 


Prices 


Ans. 
Q.3 


year from the following average wholesale 
prices of cotton in rupees per bale of 10 lbs. for the period 1980 to 1990 taking 


1980 = 100. 


Years 1980 | 1981 | 1982 | 1983 | 1984 | 1985 | 1986 1987 | 1988 | 1989 4990 | 

[Prices (Rs) 65 72 is 76 80 85 82 88 88 90 95 

Ans. 100, 110.77, 115.38, 116.92, 123.08, 130.77, 126.15, 135.38, 185.38, 138.46, 
146.15 

Q.4 Table given below shows a country's average wholesale prices of wheat for 
various years, Find the price relatives for (i) the year 1958 using 1948 as base 
(ii) the years 1949 and 1956 using 1950 as base (iii) the years 1955 — 1958 
using 1947 — 1949 = 100. 


Ans.(i) 89.2 (ii) 97.82 104.37 Gui) 101.21, 96.76, 95.14, 90.28 

Q.5 The average retail prices in dollars per ton of wheat in a certain country during 
the years 1953 ~ 1958 are given. (i) using 1953 as base, find the price relatives 
corresponding to the years 1956 and 1958. (ii) using 1956 as base, find the price 
relatives corresponding to all the given years. (iii) using 
1953 — 1955 as base, find the Price relatives corresponding to all the given 


1956 


1958 | 


years, 
Years 1953 | 1954 1955 | 1956 
Average retail price 14.95 | 14.94 | 15.10 | 15.65 16.53 
Ans. 0 104.68, 110.57 (i) 95 53, 95.46, 96.49, 100, 104.03, 105.62 


(iii) 99.67, 99.60, 100.67, 104.33, 108.53, 110.20 
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Q.6 Compute chain index numbers for the following data taking 1997 as base year. 


Li a7 | 1998 


9 
18 


Prices 0, 


Ans: 100, 102,78, 107.78, 111.11, 113.33, 121.10, 122.21 


Q.7 Convert the following prices into price relatives using chain base method 
taking 1994 as base year. 


Prices 
Ans: 100, 125, 155.55, 177.78, 194.44, 222.23, 250.01 


Q.8 From the fixed base index numbers given below, prepare chain base index 
numbers taking 1990 as base year. 
1999 | 2000 


Years 4990 ] 1991 | 1992 | 1993 | 1994 | 1995 | 1998 

| oe 
Fixed indices | 120 | 116 | 120 | 120 | 137 149 | 156 149 
Ans. 100, 96.67, 103.45, 100, 114.17, 99.27, 109.56, 104.70, 87.82, 118.25, 91.98 


Q.9 The following are the prices of six different commodities for 1989 and 1990. 
Compute a price index by (i) simple aggregative method and (ii) average of 
price relative method by using both arithmetic mean and geometric mean, 
taking 1989 as base. 


Commodity [a B ) | E F 


| Ge lee 
Price in 1989 ( Rs.) E 60 | 20 | 50 so | 100 
iced 5 
Price in 1990(Rs.) | 50 | 60 30 | 70 | 90 u0 | 
Ans.(i) 117.14 (ii) 122.9, 121.70. 


Q.10 Compute index numbers of prices from the following data taking 1981 as base 
and using median as an average: 


Prices 


Ans. 100, 115.38, 126.92, 153.85, 
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Q.11 Given the prices of four commodities, construct price index numbers by using 
simple aggregative method taking (i) 1962 as base (ii) average of all years 
aggregate as base. 


Commodities ( prices in Rs.) 


Ans. (i) 100, 108.49, 96.67, 109.15 (ii) 96.55, 104.74, 93.33, 105.38. 
Q.12 Construct chain base indices for the following years taking 1940 as base: 


2.8 3.4 


Ans. 100, 114.27, 119.38, 128.72, 135.08 
Q.13 Use chain base method to construct index numbers: 


Items 1958 1960 1961 
12 


A 10 13.5 
B 16 16.5 17 
Cc 18 18.5 19 
D 20 21 21.5 


Ans. 100, 107.73, 113.28 
Q.14 Compute the chain indices from the following price relatives: 

Coffee 
1921 
1922 


1923 
1924 
1925 
Ans. 91,33, 97.22, 101.18, 99.81, 106.27 
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Q.15 Given the price relatives of three commodities, construct the chain indices 
using median as an average: 


Price relatives 


Ans. 100, 105, 109.2, 108.11, 114.66 


Q.16 Find the chain indices from the following price relatives of three commodities 
using the geometric mean. 


Commodities 


Year B Cc 


1996 17 


54 


Ans. 90.54, 75.39, 100.14, 102.41, 75.22 


Q.17 Compute the index numbers using simple aggregative method with 1952 as 
base peer 5) 


| Commodity | 1952 


Barley 


Jawar 


Ans: 100, 97.66, 105.41, 117.47 
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Q.18 Given the prices of three commodities, compute price index numbers by simple 
aggregative method taking 1998 as base. 


Commodities | 


Ans: 100, 90.30, 114.77, 123.21, 106.75 


Q.19 Compute chain index numbers from the following price relatives using 
geometric mean as an average and taking 2000 as base year. 


=a 
Commodities 


Year A B Cc 
2000 4 10 20 
2001 4 6 23 
2002 ef 3 25 
2003 8 10 30 


Ans: 100, 88.37, 125.34, 144.23 


Q.20 Construct the following weighted aggregative index numbers of prices for the 
year 1981 from the data given below: 


(i)  Laspeyre's index number (ii) Paasohe's index number 
(iii) Fisher's index number. 


Ans. (i) 114.04 Gi) 113.97 (iii) 114 
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Q.21 Given the following information: 


2002 


Commodity 


| Quantity 


2003 
Quantity 


Price 
93 100 
10 64 ll 
3 51 5 


Construction the following index numbers of prices for the year 2003 by 


taking 2002 as the base year. 
(i) Base year weighted 


(ii) Current year weighted, 
Ans. (i) 203.58 (ii) 203.47 
Q.22 Construct price index numbers for the year 1992 on the basis of the year 1987 


of the following by using Laspeyre's, Paasche's and Fisher's Ideal index 
number formulae: 
1987 


A B c 
Price Quantity Price | Quantity | Price Quantity 
5 10 8 26 6 | 13 
25s 
| 1992 4 12 Hf 27 5 | 14 


Ans. 85.42, 85.28, 85.35 


Year 


Q.23 Compute the index numbers by Fisher's Ideal formula for the data given 
below: 
=Zpiqeo = 8800, Tpogo = 7800, Lpiqi = 15400, Lpoqi = 13710, 
Lp2qgo = 11000, <pzq2 = 23000, Lpuqz = 16370 


Ans. 112.57, 140.76 


Q.24 From the following data, find index numbers for 2002 with 2001 as base-year 
by (i) Laspeyre's formula (ii) Paasche's formula. (iii) show numerically that 
Fisher's Ideal formula is the geometric mean of these two: 


Quantities 
2002 2001 2002 
75 270 290 
45 124 | 144 
c 18 21 190°) | iaa7 
ars lea 68 18 | 200 
Ans. (i) 116.51 (i) 116.47 Gil) 116.49 
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Q.25 The following data gives the prices and quantities of four commodities for the 


years 2000 and 2002. 
[commoaiy | e000 | oon | t000 | anc 
A 


300 310 
B 240 275 
Cc 148 
D 280 360 


Construct the index number of prices for the year 2002 taking 2000 as base 
year using Paasche's formula. 


Ans. 120.64 

Q.26 Given the following information: 
Lpigo = 41140, Zpogo = 35310, LXpigqi = 46707, Lpoqi = 40048, 
Zpzgo = 39644, Zpeq2 = 51724, Lpoq2 = 47376 


Compute: (i) Base year weighted price index 
Gi) Current year weighted price index 
(ii) Fisher's Ideal price index 
Ans. (i) 116.51, 112.27 ii) 116.63, 109.18 (ii) 116.57, 110.72 
Q.27An enquiry into the budgets of the middle class families of a certain city 
revealed that on average the percentage expenses on the different groups were 
food 45, rent 15, clothing 12, fuel and light 8 and miscellaneous 20. The group 
index numbers for the current year as compared with a fixed base period were 
respectively 410, 150, 343, 248 and 285. Calculate the consumer price index 
number for the current year. 
Ans. 325 


Q.28 Compute the cost of living index number by using weighted average of price 
relatives method. 


Commodity | Weights | Base year price in Rs. | Current year price in Rs. 


A ee: 
2 
5 
12 
6 


Ans. 188.83 
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Q.29 From the data given below construct consumers price index numbers of 1986 
on the basis of 1976 by using (i) aggregative expenditure method (ii) family 
budget method. Test whether the answer is the same by both the methods. 


8 14 


Wheat 4 
Rice 15 21 

Dal 10 Se gpa 

Oil 20 30 

Ghee 6 12 

Cereals 7 14 

Vegetables 


Ans, (i) 165.22 (ii) 165.22 


Q.30 Calculate the index number for 1990 on the basis of 1978, 


Prices 


Commodity Unit of price Quantity consumed 


Rs. per maund B seers 


25 
Clothing Rs. per yard 1.50 


Ans. 145.67 
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Q.31 Construct consumers price index numbers of 1980 on the basis of 1978 using: 
(i) aggregative method (ii) family budget method: 


Prices 


Quantity consumed in 1978 1978 1980 


6 maunds per seer 


10 maunds 


per maund 


3 maunds per maund 


5 maunds per maund 


5 seers per seer 


1 maund per maund 


Ans. (i) 115.79 Gi) 115.79. 


Q.32 An inquiry into the budgets of middle class families in a certain city gave the 
following information: 


Clothing | Fuel Miscellaneous 


‘00 
| Expenseson | 35% 15% 20% 10% 20% 
Expenses on 


Prices (1989) Rs. 1000 600 
Prices (1990) Rs. 1250 900 
What changes are observed in the cost of living index number of 1990 as 


compared with that of 1989? 
Ans. 126.1. 


Chapter 


SET THEORY 


6.1, INTRODUCTION 


Set theory which was developed by George Cantor (1845 — 1918) between 
1874 — 1895, is a basic mathematical tool that is used by various branches of 
Mathematics, such as probability theory, calculus and geometry. Cantor was born in 
Russia in 1845, but moved to Germany in 1856. In 1863, he entered the university of 
Berlin, where he studied under K. Weierstrass (1815 — 1897), L. Kronecker 
(1823 - 1891), and E.E. Kummer (1810 — 1893), all of whom are famous 
mathematicians, and received his Ph. D. in i867. One of the subsequent main works 
in set theory is by F. Hausdorff, which was first published in 1914. 

6.2. SET = 

A set is any well defined collection of distinct objects. The objects of the set are 
called its members or elements, We shall denote the set 'A' whose elements are ay, 
Ag, ag, “+, ag by A= { Aj, az, ag, +, an}. Sets are usually denoted by the capital letters 
A, B, C or X, Y, Z etc. and the lower case letters a, b, c and x, y, z etc. are used to 
denote the elements. Sets are always written within pair of brackets. For example 
the sets A and B are written as A = { 1, 2, 3 }, B = { Aslam, Anwar, Ali }. The 
element 1 is member of A is written as 1 € A (1 belongs to the set A), Similarly 
Aslam « B, Anwar « B. 

A set can be defined by actually listing all its elements or by describing some 
property of all the members. The first is called tabular form or roster method and 
the second is called property method. For example, a set V of all vowels in English 
alphabet can be written by tabular form as V = { a, e, i, 0, u } and by property 
method it can be written as V = {x/x is a vowel }. It is read as the set of all 
elements x such that x is a yowel. The line / is read as "such that", 

6.2.1. NULL SET (EMPTY SET OR VOID SET ) 

A set that contains no element is defined as the null set and is usually denoted 
by $. i.e, o= { }. It is also called empty set or void set. 
6.2.2, SINGLETON SET 

A set which has only one element is called singleton set. For example, 
A={Mangla Dam },B= { Punjab }, C= { The Ravi }. 
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6.2.3. FINITE AND INFINITE SETS 

A set is finite if it contains a specific number of different elements, i.e. while 
counting the members of the set the counting process comes to an end. Otherwise a 
set is infinite. Examples of finite sets are: 


A = {1,2,5} B= {xy2twv} 
C= {x /x is a river on the earth } D= { y/y is a month of the year } 


E= { z/zis a vowel in English alphabet } 
Examples of infinite sets are: 


A = {2,468,..} B= {x/0<x<2} 
c= {y/y isa star in the sky } D= {z/zisa point ona line } 
E = { w/w is an odd number } 

6.2.4, SUBSET 


If every element of a set A is also an element of a set B, then A is called subset 
of B, and we shall write Ac B or B3 A; read as "A is contained in B" or "B contains 
A" 

If A is subset of B and B is subset of A, then we write 

Nee xin =) stage ak or A=B 

If A is subset of B, but A is not equal to B, we say that A is a proper subset of 
B. For example, if A= { a,i,u}andB= { a,e,i,o,u }, then A is proper subset of B. 
6.2.5. UNIVERSAL SET OR SPACE SET 

‘A set which contains all elements of all sets under consideration is called 
universal set or space set and is denoted by U or S. 

if A = {1,2,3},B = {3,4,5,6}, C = {10,8}, D = {9,6 

thenU = S = {1,2,3,4,5,6,8,9, 10}. 

6.2.6. IDENTICAL SETS OR EQUAL SETS 

Two sets A and B are said to be identical or equal if they contain exactly the 
same elements. 

If A = { e258; 4}, B= {4, 1, 2,3 }, A and B are called equal sets. 
6.2.7. VENN DIAGRAM 

Venn diagram is a diagram in which universal set S is represented by a 
rectangle and sets are represented by circles or parts of circular regions. 
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6.3. OPERATIONS ON SETS 

Given a universal set S and its subsets, A, B, C, ..., we can perform on these 
subsets certain operations that produce other, or perhaps the same, subsets. Among 
all the operations on sets, we are interested only in union, intersection, difference 
and complementation. 
6.3.1. UNION OF SETS 

Union of the two sets A and B is denoted by Au B. If A and B are two sets, 
then A U B is a set which contains those elements which belong to A or B, or both, 
ie., 


AUB= {x/xeA or xeB} 


Let A = {1,2,3} Let A = {2,4,6} 
and B = {3,4,5} and B = {8,9, 10} 
then the union of A and B is then the union of A and B is 
AUB= {1,2,3,4,5} AUB= {2,4,6,8,9, 10} 
Venn diagram 
A B (eri B 
AUB is shaded area AWB is shaded area 
Fig.6.2.(a) Fig.6.2.(b) 


6.3.2. INTERSECTION OF SETS 


’ 
Intersection of the sets A and B is denoted by Aj B. If A and B are two sets, 


then An B is a set which contains those elements which are common to both A and 
B, ie., 


ANB= {x/xeA and xe B} 

Let A={1,3, 4,5} andB={2, 3, 4, 6}, then the intersection of A and B is: 
ANB= {3,4} 

Venn diagram 


A Bis shaded area 
Fig.6.3. 
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6.3.3. DIFFERENCE OF TWO SETS 

If A and B are two sets, then A — B is a set which contains those elements of A 
which are not in B, ie. A-B = {x/xeA and x¢B} 

Let A = {1,2,3,4} and B={3, 4,5, 6}, thenA-B = {1,2} 

Venn diagram 2 


A~B is shaded area 
Fig.6.4. 


6.3.4. COMPLEMENT SET 


If A is a set and S is a universal set, then A is a set which contains those 
elements of S which are not in A, i.e., 


A = {x/xeS and xeA} 
let S = {1,2,3,4,5,6,7,8,9,10} and A= {1,2,3}, then 


A = {4,5,6,7, 8,9, 10} 


Venn diagram 


A is shaded area 
Fig.6.5. 


6.4. DISJOINT OR MUTUALLY EXCLUSIVE SETS 


Two sets A and B are said to be disjoint or mutually exclusive if they have no 
elements in common, ie., ANB = 6. 


Let A={1,2,3,4} and B={7,8,9, 10}, then ANB=$. 


[Chapter 6] Set Theory 249 


Venn diagram 
A B 
e : 
ANB= 
Fig.6.6, : 
6.5. CLASS OF SETS A 


If the elements of a set E are also sets, then the set E is called a class of sets or 
a set of sets is called a class of sets. ja 


Let A = {1,2}, B={2,3,4,5}, C={1,9} and D={6, 7,8, 10} 
and E = {A,B,C,D}, thenBis called class of sets. 
6.6. POWER SET 


The set of all possible subsets of a set 'A' is called power set of A and is denoted 
‘by P(A). 


Let A = {1,2,3} 
There are 2°= 8 subsets from a set of 3 elements, i.e., 
Ai= {1}, A= {2}, 
A, = {3}, A,={1,2}, A,={1,3} 
Ac= {2,3}, Ar= {1,23} and A,= ean 
Then P(A) = {Aj, Ay, Ag, Ay, As, Ag, Ar Ag} 
6.7, PRODUCT SET 


If A and B are two sets, then A x B contains all those elements of the type (x, y) 
where x e Aandy &B,i.e., 


AxB = {@y)/xeAyeB} 
let A = {1,2,3}-and B = {w,x}, then 4 
AxB = {(1,w), (1%), @,w), 2», @, w), (3, 9} 
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Tree diagram 


A . B AxB 
w (1, w) 
1 
% (1, x) 
w (2, w) 
2 
x (2, x) 
w , G8) 
3 
x (3, x) 
Fig.6.7. 


In general Ax B is not equal to Bx A. 
6.8. PARTITION OF SET 


If S is a universal set, and we divide S into different disjoint and non empty 
subsets, it is called partition of set S. 


S = {12,34}, A= {1}, 4 = {23}, As= {4} 
A,, Ay and A; form partition of S, ie., 
@ AMA, = 6 forall izj 
Gi) A;\UA,UA,U..UA, = 8 
Example 6.1. 
If S = {1,2,3,4,5,6,7,8,9, 10} 
Consider the subsets of S given as: 
A = {1,2,3, 10}, B={2,4,6,8},C={1,3, 5,7} and D={8, 9, 10}. 
Thenfind (@) AUB @i)AUC Gi) ANC Gv) COD 


® BoD wid (wii) 6. 
Solution: 
@) AUB = {1,2,3,4,6,8,10} Gi) AUC= {1,2,3,5,7, 10} 
Gil Ane Sefer (iv) CAD= 6 
(v) BAD = {8} (i)D = {1,2,3,4,5,6,7} 


(vii) C = {2, 4,6, 8,9, 10} 
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Example 6.2. 
Let S = {1,2,3,4,5,6,7,8,9}, A={1,2,3, 4}, 


B= 424,68} and C={3, 4,5, 6}. 
Find@ A GD Gi) (ARC) Gy) (AUB) W) (A) w)(B=C) 
Solution: 
(i) A = {5,6,7,8,9} 
Gi) B =. {953,579} 
Anc. = {3,4} 


dil) (ANC) = {1,2,5,6,7,8, 9} 
AUB = {1,2,3,4,6,8} 


(iv) (AUB) = {5,7,9} 
v) @ = {1,234} = A 
(B-C)= {2,8} 
wi) (B-C)= {1,3,4,5,6,7,9} 
Example 6.3. 
let A={2,3}, B={3,4} and C={4} be subsets of the universal set 
$={2,3)4 }. Determine the sets. 


@ AxA Gi) Ax B Gii) Bx A Gv) (AxB)A(BxC) 
(v) (AxB)U(BxC)_ 
Solution: 


@) AxA { (2, 2), (2, 3), (3, 2), (3, 3)} 
Gi) AxB = {(2,3),@, 4), @, 3), @, 4)} 
(ii) BxA = {(3, 2), (3, 3), (4, 2), (4, 3)} 
BxC = {(3,4),(4,4)} 
(iv) (AxB)A(BxC)= {3,4} 
(v) (AxB)U(BxC)= {(2, 3), (2, 4), G, 3), (3, 4), 4 4)} 


Example 6.4. 
Given S = {k,1,m,n,o,p}, A= {k,1}, B = {m,n,o,p}, C={o,p}. 
Find (i) c Gi) AUC (ii) AUB (iv) ANB 


@) AUS  (Wi)SAB_ (ii)(AUB) wii) (AB) 
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Solution: 
Giae = {k,]lmn} Gi) AUC = {k,Lo,p} 
Gi) AUB = {k,lm,n,o,p} GvJANB= 6 


(v) AUS = {k,Lm,no,p}= 8 (Wi)SOB= {m,n,o,p} = B 
(vii) (AUB) = .6 
(viii) A = {m, n, 0, p} B = {k, 1} ANB = > 


@nB) = {klmnop}= § 
Example 6.5, 

Let A = {1,2,3}, B={2,4} ana C={8, 4,5}. FindAx Bx. 
Solution; 


A convenient method of finding A x B x C is through the so called "tree 
diagram" shown below: 
AxBxC 
(i, 2, 3) 


(1, 2,4) 
(1, 2, 5) 
, 4, 3) 
(1, 4, 4) 
(1, 4, 5) 
(2, 2, 3) 
(2, 2,4) 
(2, 2, 5) 
(2, 4, 3) 
(2, 4, 4) 
(2, 4, 5) 
(3, 2, 3) 
(3, 2, 4) 
(3, 2, 5) 
(3, 4, 3) 
3,4, 4) 
(3, 4, 5) 


w nN 
nN a cs) 
Vk YUH RUN ROH BUH A WU WOH 
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6.9. ALGEBRA OF SETS 


Having defined the operations of union, intersection, difference and 
complementation for sets, we may formulate an "algebra of sets". Many of the basic 
laws of set theory are analogous to the rules of algebra of real numbers, 


(1) Identity Laws 

Gi) AUb=A Gi) AN? =o ii) AUS=S (iv) ANS =A 
(2) Idempotent Laws 

G@) AUA=A Gi) ANA=A 
(8) Associative Laws 

@ (AUB)UC = Au(BUC) 

Gi) (ANB)AC = An(BOC) = ANBNC 
(4) Commutative Laws 

@ AUB= BUA Gj) ANB= BoA 
(5) Distributive Laws 

@ AUBNAC)=AUB)A(AUC) Gi) An (BUC) =(ANB)U(ANC) 
(6) Complement Laws 


@ AVA= 8 fi) ANA = 6 
Gi) @ = A @)ANO= 6 


(7) Demorgan's Laws 

@ (AUB) = ANB (i) (AQB) = AUB 
6.10. MULTIPLICATION PRINCIPLE 

The multiplication principle states that if an operation can be performed in n, 
ways, and then after it is performed in any one of these ways, a second operation can 
be performed in ng ways, and after this second operation is performed in any one of 
these ways, a third operation can be performed in Ny ways, and so on for k 
operations, then the k operations can be performed in 

: Ny X Ny X Ny X ... XN, Ways 

For example, in how many ways can the three letters A, B and C be arranged? 

By the principle of multiplication, we have 
MmyXngxng = 3x2x1 = 6 ways 

If a departmental store has five entrances and six exits, in how many ways can 

one enter and leave the store? Here, we write 


nxng= 5x6 = 30 ways 
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6.11. FACTORIALS 

If nis a positive whole number, the product n(n-—1)(n—2)... 3.2.lis called 

‘factorial n' and is denoted by the symbol n!. So 
n! = n(n-1)(n—-2)...(3)(2)(1) 

In particular, we see that 

l= 1 
2) = 2x1 = 2 
3! = 3x2xl = 6 
44 = 4x8x2x1l = 24 
10!= 10x9x8x7x6x5x4x3x2x1= 3628800 

Electronic calculators have the facility of calculating factorials, Zero factorial is 
equal to 1. 

Thus 0! = 1, but (-4)!, is not defined and has no meaning. 

6.12. PERMUTATIONS . 

A permutation of a number of objects is an arrangement of these objects in a 
definite order. The number of permutations of a set of n things, taken r at a time is 
represented by "P, and is given by 

5p. = aa = afn-1)(-2)... (n—r+ 1) ways 

For example, in how many different ways can the three letters a, b and c be 

arranged by taking two at a time? The answer is 
1 
*pi= ree = 6 ways 

These permutations are ab, ba, ac, ca, be, cb. 

The number of permutations of a set of n things, taken all at the same time, is 
n!. Denoting this number by "P,, we have 

n! n! 
Gas mpluarsan0! aan 

For example, in how many different ways can the 10 digits 0 to 9 be arranged 
by taking all of them at a time? The answer is 

MP. = 10!= 10x9x8x7x6x5x4x3x2x1= 3628800 

The number of permutations of n things consisting of groups among which n, 

are of one kind, nz of a second kind, ng of a third kind and so on, n, of a kth kind is 
n! 


ap = 


RE iat where n} +ng+nj+...+ny= n 


For example, in how many ways can the letters of the word "Samasatta" be 
arranged. Here, total number of letters is 9, 's' occurs 2 times, ‘a’ occurs 4 times, 't' 
occurs 2 times 'm' occurs 1 time. So the number of permutations is 

9! 9.8.7.6.5.4! 4 
Davos OT Algehde gc OAee Ways 
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6.13. COMBINATIONS 


A combination is a selection of objects considered without regard to their order. 
The total number of combinations of a set of n things taken r at a time, with n 2 r, 


usually denoted by °C, or by (a) is 


°C = ad % n! _ n(n-1)(m-2)... (n-r+1) 
Sane fe ~ (n=)! r! 


The number of combinations of the letters a, b and c taken two at a time is 
>) be 
i eh 
These combinations are ab, ac, be. 
Note: Here, ab is the same combination as ba. 


A basket ball squad has 10 players. The coach must select a starting team. 
How many different teams of five players can be selected for this purpose? Here, we 
have 


10 10! 10x9x8x7x6x5! Pees 2 
ee Ses aa = 5lx5x4x3x2x1 ~ 252 combinations 
The students are advised to remember the following relations 
n n! 
, = (7) = (n-1y = 2 
nalts F) Epon oe pchyeeamle 
ee é ni(n-ny = nto = ? 
coon (08) = atts - ©) 
Cn-» = bess ~ r(n-r)! <r: 
0! = 1 
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EXERCISES 


Q.1 If S={0, 1, 2,3, 4,5,6,7,8}, A={0, 1,3, 5,7}, 
B={0, 2, 4,6,8},C={1,2,3,4} and D={0,5, 6}. 


list the elements in the following sets. 


@) ANB (i) CUA ii) D (iv) (CAD) UB, 
(SAA) (wi) ANBOD. 
Ans. (i) {0} di) {0,1,2,3,4,5,7} ii) {1, 2,8, 4, 7, 8}, 


(iv) {0, 1, 2, 3, 4, 6, 8} (v) {2, 4, 6, 8} (vi) > 


Q2 If S = {1,2,3,4,5,6,7,8,9}, A= {2,4,7,9}, B={1,3,5, 7,9}, 
c={2,3,4, 5} and D= {1, 6, 7}. List the elements of the sets corresponding 
to the following events: 


@), 45.06 @ BoC (ii) (SAB) (vy) G@aD)uB 
(Vv) (BAG)VA Wi) ANCAD 

@ {1,2,3,4,5,6,8} Gi) {1,7,9} ii) {1,3,5,7,9} 
(iv) { 1, 3, 5, 6, 7, 9} (v) {1, 2, 4, 7, 9} (vi) { 2, 4} 


Q3 Let S = {3,4,5,6,7,8,9, 10, 11,12}, A= {4,5,6},B={5,6,7} and 
C= { 7,8,9 y List the members of the following events 


An: 


a 


@ AnB G)AUB @)E@NB) “GARBVOQ ws 
Ans.(i) {7} (i) {3,5, 6,7, 8,9, 10, 11,12} ii) { 4, 5, 6, 7} 
Gv) {3, 4, 7, 8,9, 10, 11,12} w { } =o 
Q4 Let S={a,bcodefghi,j}, A={a,b,c,d}, B={d,e,f}, 
C={a,c,e,i,j}. Find @) AAC Gi) BUC Gi) © AA 


(iv) ANS (vy) (AUB) 
Ans. (i) {a,c} (i) {a,c,d,e,f£i4j} Gi) {£e,h} Gv) {a,b,c a} 
() {g,h, ii}. 
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Q.5 Consider the sample space 
si { bus, train, aeroplane, boat, bicycle, automobile, motorcycle } 
and the events A = { bus, train, aeroplane "i 
B= { train, automobile, boat}, C= { bicycle }. 
List the elements of the sets corresponding to the following events: 
(@) (A UB) Gi) (BNE)nA Gil) (A UB)A(A AG) 
Ans, (i) { train, boat, bicycle, automobile, motorcycle } (ii) { train } 
(iii) { boat, automobile, motorcycle } 
Q.6 If S={x/0<x<12},A={x/1<x<9} andB={x/0<x<5}, find: 
@) AUB Gi) ANB (ii) A AB 
Ans. (i) {x/0<x<9} (i) {x/1<x<5} (ii) {x/8<x<12} 
Q.7 Let A={2,3},B= {1,3,5} and C= { 3, 4}. Construct the tree diagram of 
Ax Bx Cand then find Ax Bx C. 
Ans. (2, 1, 3), (2, 1, 4), (2, 3, 3), (2, 3, 4), (2, 5, 3), (2, 5, 4), (3, 1, 3), (3, 1, 4), 
(3, 3, 3), (8, 3, 4), (3, 5, 3), (3, 5, 4). 
Q.8 Let S={a,b,c}, T={b,c,d} and W= {a,d}. Construct the tree diagram 
of S x T x W and then find § x T x W. 
Ans. (a, b, a), (a, b, d), (a, ¢, a), (a, ¢, d), (a, d, a), (a, d, d), (b, b, a), (b, b, d), (b, c, a), 
(b, ¢, d), (b, d, a), (b, d, d), (¢, b, a), (¢, b, d), (, ¢, a), (¢, c, d), (c, d, a), (c, d, d) 
Q.9 Let A={6,7}, B= {7,8} and C= { 8 } be subsets of the universal set 
S={ 6, 7, 8}. Determine the elements of the following sets: 
@ AxC (i) AxB G@i)BxA (iv) (Ax B)U(AxC) 
(v) (AxB)A(AxC). 
Ans. (i) { (6, 8), (7, 8)} Gi) {6 7, 6 8), (7, 2), (7, 8)} 
(iii) { (7, 6), (7, 7), 8, 6), (8, 7)} Gv) {(6, 1), (6, 8), (7, 7) (7, 8) } 
©) {6 8), (7, 8)} 


Q.10 There are five different roads connecting two towns A and B. In how many 
ways can a person go from A to B by one road and return by another? 


Ans. 20 ways 


Q.11 How many lunches are possible consisting of soup, a sandwich, dessert and a 
drink if one can select from 5 soups, 2 sandwiches, 4 desserts and 3 drinks? 


Ans. 120 ways 
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Q.12 There are five doors in a room. Three persons have to enter it. In how many 
ways can they enter from different doors? 

Ans. 60 ways 

Q.13 How many sample points are in the sample space when a pair of dice is thrown 
once? 

Ans. 36 ways 


Q.14There are six seats available in a compartment. In how many ways can six 
persons be seated? 


Ans. 720 ways 


Q.15 Two lottery tickets are drawn from 25 for first and second prize. Find the 
number of sample points in the sample space S. 


Ans. 600 sample points 
Q.16In how many different ways can a 11 ~ men football team be chosen from a 
squad of 20 men if the positions are not ignored. 


Ans. 6.7044257 x 10! ways : 

Q.17 How many possible permutations can be formed from the word Statistics? 
Ans. 50400 ways 

Q.18 In how many ways can the letters of the word "Mathematics" be arranged? 
Ans, 4989600 ways 

Q.19 How many possible permutations can be formed from the word Economics? 
Ans. 90720 ways 

Q.20Evaluate (i) *P, (ii) "Ps (ii) “P, (iv) "Ps (v) ®P, 
Ans. (i) 3024 Gi) 2520 = (iii) 240 : (iv) 120 (v) 48 


6\(4 7\(6 
Q.21Evaluate (i) (2) Gi) (s)(3) ) ) (iii) GG) ‘n ) 
5 8 4)\( 48 
ty 9 BF) oO 
Ans. (i) 70 (ii) 0.3810 (iii) 0.4079 ~— (iv) 0.0068 (v) 0.0026 (vi) 56 
Q.22 In how many ways can a committee of 4 people be chosen out of 8 people? 
Ans. 70 


Q.23 A boy has five coins each of different denomination. How many different sums 
of money can he form? 


Ans. 31 


Chapter 


7 
PROBABILITY 


7.1. INTRODUCTION 

We live in a world of uncertainties, Man is surrounded by situations which are 
not fully under his control. The nature commands these situations. A person on a 
road does not know whether or not he will reach his destination safely. A patient in 
the hospital is never sure about his survival after a delicate operation. What will be 
the weather conditions tomorrow? A flight will be late or will reach in time, the road 
will be clear or there will be some traffic jam; we face such problems in our daily life. 
Man is always curious to know as to what will happen in future. The events which 
will happen in future are important for the man today. These events are based on 
what is called chance or probability. The numerical measure of uncertainty is called 
probability. We may find a numerical measure for a bulb to be defective, numerical 
measure for the rain to fall. The belief or confidence associated with a certain 
situation can also be measured. It is also called probability. In statistics, there are 
various situations where uncertainty is involved. Such situations need the 
application of probability. Probability is widely and rightly used in statistical 
decisions. The areas of statistics; where probability is used are called the areas of 
statistical inference. Statistical inference is not possible without the use of 
probability. Probability is used in all fields of life where uncertainty prevails. The 
knowledge of probability is used in space research, astronomy, business, weather 
forecast, economies, genetics and various other fields of life. It is simple to explain 
various concepts of probability with the help of set theory. Thus we shall use her> 
the set theory notation which has been discussed in the previous chapter. 


7.2, RANDOM EXPERIMENT 

The word experiment or random experiment is used for a situation of 
uncertainty about which we want to have some observation, The actual result of the 
uncertain situation is called outcome or sample point. In the random experiment, 
nothing can be said with certainty about the outcome. An experiment may result in 
different outcomes, even though it is performed under similar conditions, The term 
random trial or simply trial is used, if an experiment is performed only once. A bulb 
may be selected from a factory to examine whether it is defective or not. The 
selection of a single bulb is a trial. We can select any number of bulbs. A random 
experiment has the following properties: 
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(i) The experiment can be repeated any number of times. We may select one or 
more than one items for inspection. The number of repetitions is called the size 
of the experiment. In statisties, the size of the random experiment plays a 
major role in statistical inference. 

(ii) A random trial consists of at least two possible outcomes. If a basket contains 
all the defective bulbs, a selected bulb will be certainly defective. It has only 
one possible outcome. It is not a random trial. If the basket contains some good 
and some defective bulbs, a selected bulb will be good or defective. In this case 
there are two possible outcomes. Thus selecting a bulb from such a basket is a 
random trial. 

(iii) Nothing can be said with certainty about the outcome of the random trial or 
random experiment. If a sample of four bulbs is selected, may be one bulb is 
defective. When another sample of four bulbs from the same lot is selected, may 
be all the bulbs are defective. Thus the result of the experiment cannot be 
predicted even if the experiment is repeated a large number of times. 

7.3. SAMPLE SPACE 
A complete list of all possible outcomes of a random experiment is called 

sample space or possibility space and is denoted by S. Each outcome is called 

element of the sample space. A sample space may contain any number of outcomes. 

If it contains finite number of outcomes, it is called finite or discrete sample space. 

When two bulbs are selected from a lot, the possible outcomes are four which can be 

counted as 

(i) both bulbs are defective (i) first is defective and second is good 

(iii) first is good and second is defective. (iv) both are good. 

Here the sample space is discrete. When the possibilities of the sample space 
cannot be counted, it is called continuous. The number of possible readings of 
temperature from 45°C to 46°C will make a continuous sample space, 

Sample space is the basic term in the theory of probability. We shall discuss 
some sample spaces in this chapter. It is not always possible to make the sample 
space. If it contains very large number of points, we cannot register all the outcomes 
but we must understand as to how we can make the sample space. The outcomes of 
the sample space are written within the brackets { }. Some simple sample 
spaces are discussed below: 

(i) A Coin is Tossed 
When a coin is tossed, it has two possible outcomes. One is called head and the 

other is called tail. Anyone of the two faces may be called head. To be brief, head is 

denoted by H and tail is denoted by T. Thus the sample space consists of head and 
tail. In set theory notation, we can write S as: 
S = {head, tail} or S = {H,T}. 

(ii) Two Coins Tossed 
When two coins are tossed, there are four possible outcomes. Let H, and T, 

denote the head and tail on the first coin and H, and T, denote the head and tail on 
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the second coin respectively. The sample space S can be written in the form of a 
table as below: 


Second coin 
First coin Ha | T2 


mi (HiR2) (HiT2) 
Th (TiH2) (TiT2) 
This sample space can also be written as 
S = {(H,H,), (T\H,), (H,T,), (T,T,)} 

It may be noted here that a sample space of throwing two coins has 4 possible 
points. A sample space of 3 coins will have 2 = 8 possible points and for n coins, the 
number of possible points will be 2", 

(iii) A Die is Thrown 

An ordinary die which is used in games of chances has six faces, These six faces 
contain 1, 2, 8, 4, 5, 6 dots on them, Thus for a single throw of a die, the sample 
space has 6 possible outcomes which are : 

Si Soqed.Ba4lsei 
(iv) Two Dice Thrown 


A die has six faces. Each face of the first die can occur with all the six faces of 
the second die. Thus there are 6 x 6 = 36 possible pairs or points when two dice are 
thrown together. These 36 pairs are written below in Table 7.1. 


Table 7.1. 
if Second die 
3 4 5 6 

(1, 3) (1, 4) (1, 5) (1, 6) 
(2,1) (2,2) (2, 3) (2, 4) (2, 5) ( 2, 6) 
(3, 1) (3, 2) (3, 3) (3, 4) (3, 5) (3, 6) 
(4, 1) (4, 2) (4, 3) (4, 4) (4, 5) (4, 6) 
(5, 3) (5, 4) (5, 5) (5, 6) 
(6, 3) (6, 4) 

If 3 dice are thrown, the sample space will have 6° = 216 possible outcomes 
where each outcome is composed of three digits. 
7.8.1. EVENT OR SUBSET 


Any part of the sample space is called an event or a subset. An event may 
contain one or more than one outcomes. When an event consists of a single outcom 
(sample point), it is called a simple event. An event which has two or more outcomes 
is called a compound event. The sample points contained in an event are ritten 


within brackets { }. If we consider a single face when a die is thrown. it is a sunir 
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event. Getting 6 on a die when thrown, is called a simple event. If the event is any 
prime number on the die, the event consists of the points { 2, 3, 5 }, which is a 
compound event and consists of three simple events which are { 2}, {3 } and { 5}. 
When two dice are thrown, the pair (1, 1) is a single outcome in the sample space S 
and is therefore a simple event. The event "total is 3" consists of two outcomes that is 
(1, 2) and (2, 1). Thus "total is 3" is a compound event. 

If a random experiment can produce n sample points, it has n simple events. 
Throw of a single die has 6 simple events and a throw of two dice produces 36 simple 
events. 

The empty set > is also an event but it is not a simple event. 

The sample space S is a compound event and is called a certain event. 

7.3.2. TOTAL NUMBER OF EVENTS 

If a sample space has n sample points, then there are 2" events which can be 
defined on this sample space. The number 2” includes the empty set, n simple events 
and all possible compound events. Suppose two babies are born in a family. Let M 
denote the male baby and F denote the female baby. The sample space has four 
possible outcomes which are 

Ss = {(MM,), (MiF2), (FiM2), (FiF2)} 
The subscripts 1 end 2 stand for the first and second baby respectively. 


All possible events which can be defined on this sample space are given in the 
following table. 


Nature of Event Outcomes 1; ae of 
Empty set o 1 
Simple events {M,M,}, {M,F, },{F,M,}, {FF} 4 
Compound events {( M\M,), (M,F,) } {( M,M,), (F,M,) } 

{(M,M,), (F,F,) }, {(M,F,), (F,M,) } 6 


{(MF,), (F,F,) }, (CFM), (FiF,) } 
{(M,M,), (MyF,) (F,M,) } 
{(M\M,), (MF), (F,F,) } 4 
{(M,M,), (F\M,), (F,F,) } 
{(M,F,), (F,Mz) (F,F,)} 
{(M,M,), (M,F,), (F\M,), (F,F,)} 
Total possible events = 2" = 24= 16 
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Example 7.1. 

Two coins are tossed. Write all possible points of the sample space. Write the 
following events: 

A, — both are heads A, — head on the first coin 

A, — a head and a tail appears A,— at least one head appears 

A; — both coi1s have the same faces A, — head on the first coin and tail on 

the second coin 

Which of these events are simple? 
Solution: 

The sample space S is 


5 = {(H,H,), (H,T,), (7A), (T,T,)} 
The outcomes in the events are 
4,= {(1,H,)}} A, = {(HH,), (H,T,)} 
A, = {(H,T,), (T\H,)} A, = {(4,H,), (H,7,), (1,H,)} 
A, = {(H,H,), (7,7,)} A, = {(H,T,)} 


The events A, and A, are simple and all other events are compound. 


7.3.3. EQUALLY LIKELY OUTCOMES 

The outcomes of a sample space are called equally likely if all of them have the 
same chance of occurrence. It is very difficult to decide whether or not the outcomes 
are equally likely. But in this book we shall assume in most of the experiments that 
the outcomes are equally likely. We shall apply the assumption of equally likely in 
the following cases. 
(i) Tossing a Coin or Coins 

When a coin is tossed, it has two possible outcomes called head and tail. We 
shall always assume that head and tail are equally likely if not otherwise 
mentioned. For more than one coin, it will be assumed that on all the coins, head 
and tail are equally likely. 
(ii) Throwing a Die or Dice 

Throw of a single die can produce six possible outcomes. All the six outcomes 
are assumed equally likely. For any number of dice, the six faces are assumed 
equally likely. 
(iii) Playing Cards 

There are 52 cards in a deck of ordinary playing cards, All the cards are of the 
same size and are therefore assumed equally likely, 
(iv) Drawing Balls from a Bag 

There are many situations in probability +1 which some objects are selected 
from a certain container. The objects of the cuutainer are assumed to be equally 
likely, A famous example is the selection of a few balls from a bag containing balls of 
different colours. The balls of the bag are assumed to be equally likely. 
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7.3.4. NOT EQUALLY LIKELY OUTCOMES 

When all the outcomes of a sample space do not have equal chance of 
occurrence, the outcomes are called not equally likely. When a matchbox is thrown, 
all the six faces are not equally likely. If a bag contains balls of different sizes and a 
ball is selected at random, then all the balls are not equally likely. 
7.3.5. MUTUALLY EXCLUSIVE EVENTS 

Two events are called mutually exclusive or disjoint if they do not have any 
outcome common between them. If the two events A and B are mutually exclusive, 
then A > B = $ (null set). For three mutually exclusive events A, B and C, we have 
AOBOC=6$. Suppose there is a sample space S as: 


{1, 2, 3, 4, 5, 6, 7, 8, 9, 10} 
Let A = {multiplesof3}= {3,6,9} 


B { multiples of 5} = {5, 10} 

Here A B= 69. Thus A and B are mutually exclusive events. Both A and B 
belong to the same sample space but they are completely different and both cannot 
happen at the same time. A class of students may contain first divisioners, second 
divisioners and third divisioners. When a student is selected from the class, he will 
be any one of the three groups of students. Thus three groups of students are 
disjoint or mutually exclusive. When the two events A and B are mutually exclusive, 
we can show them with the help of a venn diagram. The venn diagram in Figure 7.1. 
shows that AN B=9. 


i 


ANB=$6 
, Fig. 7.1. A and B are Mutually Exclusive Eveats 

7.3. 6. NOT MUTUALLY EXCLUSIVE EVENTS 

The events are called not mutually exclusive if they have at least one outcome 
common between them. If A and B are not mutually exclusive events, then 
A B+ 6. Similarly A, B and C are not mutually exclusive events if AW Bn C # . 
Thus they must have at least one common point between them. Consider a sample 
space : 

S = {1,2,3,4,5,6, 7,8, 9, 10,11} 

{ 2,3, 5,7, 11} 


{ 1, 3,5, 7,9, 11} 


Let the event A = {prime numbers } 
and B = {odd faces} 
Here AnBe= {a,5 } 
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Thus ANB#9 ie, AV B exists. Here, A and B are not mutually exclusive 
events. A 4 B consists of outcomes which are common to both A and B. Figure 7.2. 
shows a venn diagram in which A and B are not mutually exclusive events. Some 
area under A is common with B. If the event A is a part of the event B, then 
AO B=A. This is shown in Figure 7.3. 


ANB 
Fig. 7.2. A and B are Not Mutually Exclusive Events 


Sok 


ANB=A 
Fig.7.3. A and B are Not Mutually Exclusive Events 

7.3.7. EXHAUSTIVE EVENTS 

When a sample space S is partitioned into some mutually exclusive events such 
that their union is the sample space itself then the events are called exhaustive events 
or collectively exhaustive events. Suppose a die is tossed and the sample space is 

S= {1,2,3,4,5,6}. Let A = {1,2} B = {3,45} C = {6} 

Here the events A, B and C are mutually exclusive because A 1 Bm C = 6 and 
AUBU C&S, Figure 7.4. shows three events A, B and C which are exhaustive. 


AnBaC=$ and AUBUC =S 
Fig. 7.4 
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7.3.8. AORB 
The term A or B is very important in probability theory. The term 'A or B' is 

used for two events A and B when they are mutually exclusive and we are interested 

to know the probability of the event "A or B". For this purpose the symbol A vu B is 
used, It must be remembered that the symbol A U B is also used for the event "A or 

B or both", when A and B are not mutually exclusive. 

7.3.9. A AND B 
The term 'A and B' is used for the event which consists of the points which are 

common to both A and B. In set theory language, the symbol A - B is used for A and 

B. The symbol A > B means intersection of A and B. 

7.3.10. REDUCED SAMPLE SPACE 
Sometimes the sample space S is reduced in size and is called reduced sample 

space. The symbol S; may be used for a reduced sample space. Suppose a die has 

been thrown and we have been informed that the experiment has produced an even 
face. This type of information is called the additional information. Thus the reduced 
sample space is determined by additional information. In this example the 
information has disclosed that even face has occurred. If it becomes known as to 
which even face has occurred, then it is no more a situation of probability. When the 
information is that the face is even, then there is still something hidden from the 
experimentor. The actual outcome is not known to the observer. In this case the 
reduced sample space S; is 

Ss: = {24,6} 

7.3.11. RELATIVE FREQUENCY 
The term relative frequency is used for the ratio of the observed frequency of an 

outcome and the total frequency of the random experiment. Suppose a random 

experiment is repeated N times and an outcome is observed f times, then the ratio 

f/N is called the relative frequency of the outcome which has been observed f times. 

Some examples of relative frequencies are given here : 

(i) We select bulbs from a certain big lot to examine whether they are good or 
defective. We take, say 100 such bulbs and examine them. Sixty bulbs are 
found defective. The symbol N may be used for 100 an:. the symbol f may be 
used for the observed frequency which is 60. Thus the 
Relative Frequency = f/N = 60/100 = 0.6 

(ii) We are interested to know whether a coin is unbiased (true) or not. We toss the 
coin say 200 times and note the number of heads. In 200 tosses, the number of 
heads may be, say 110. The relative frequency of this experiment for number of 
heads is 110/200 which is not 1/2. As we shall see later, the probability of head 
is usually written as 1/2. It is just an assumption and of course a big 
assumption. If we repeat the same experiment again, the number of heads may 
be less than or more than 110 as observed in the first experiment. This is what 
happens in random experiments 

(iii) A die is thrown and we are interested in the ace (face 1). We throw the die say 
60 times and ace is observed 1° nes. Thus the relative frequency of aces is 
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12/60 = 1/5. For an ideal die one should expect that the number of aces would 
be 60/6 = 10. At some later stage, we would like to know more about the ratio 
12/60. This ratio is not something constant. A next random experiment with 
the same die may produce a completely different result. 

(iv) The families having four children are being studied. Our study is based on 500 
families. We have counted ,the families who do not have a male child. This 
observed frequency is 50, We have also counted the number of families having 
1, 2, 8 and 4 male children, the number of families being 100, 200, 105 and 45, 
Thus the observed frequencies of families having 0, 1, 2, 3, 4 male children are 
50, 100, 200, 105 and 45. This information and the relative frequencies can be 
written in the form of a table below: 


Table 7,2. 
Number of Number of families Relative 
male children (f) frequencies 
0 50 500 = 0.10 
OOK ees, 
1 100 500 0.20 
200 
2 200 500 = 0.40 
105 
3 105 500 = 0.21 
45 
4 45 500 ~ 0:09 
L____ Total 500 1.00 


In Table 7.2. we find the relative frequencies which are not equal. The students 
should not expect that the relative frequencies should be equal. 
7.3.12. COMPLEMENTARY EVENTS 


If A is an event defined in the sample space S, then $ — A is denoted by Aand 
is called complement of A, 


Thus A = sok or AVA= 5S 
Figure 7.5, shows the event A and the complement of A. 


Fig. 7.5. Ais ©) 
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7.4, DEFINITIONS OF PROBABILITY 


Probability is something strange and it has been defined in different manners. 
We can define probability in objective or subjective manner. Let us first use the 
objective approach to define probability. 
7.4.1. THE CLASSICAL DEFINITION OF PROBABILITY 

This definition is for equally likely outcomes. If an experiment can produce N 
mutually exclusive and equally likely outcomes out of which n outcomes are 
favourable to the occurrence of event A, then the probability of A is denoted by P( A) 
and is defined as the ratio n/N. Thus the probability of A is given by 


P _ Number of outcomes favourabletoA — n 
(A) = Number of possible outcomes aa Soi 


This definition can be applied in a situation in which all possible outcomes and 
the outcomes in the event A can be counted. This definition is due to P.S. Laplace 
(1749 — 1827). The classical definition is also called the priori definition of 
probability. The word priori is from prior and is used because the definition is based 
on the previous knowledge that the outcomes are equally likely, When a coin is 
tossed, the probability of head is assumed to be 1/2. This probability of 1/2 is based 
on this classical definition of probability. It is assumed that the two faces of the coin 
are equally likely. In practical life the people do believe that a coin will do justice 
when it is tossed. In the playgrounds, the participating teams toss the coin to start 
the match. A coin in which probability of head is assumed to be equal to the 
probability of tail is called a true or uniform or an unbiased coin. But it is all an 
assumption. The probability of a certain event is a number which lies between 0 and 
1. If the event does not contain any outcome, it is called impossible event and its 
probability is zero. If the event is as big as the sample space, the probability of the 
event is 1 because 

Nut.ber of outcomes in the event N 


P( the event) = Total number of outcomes NN 


When probability of an event is 1, is called a 'sure' or 'certain', event. 

CRITICISM 
The classical definition of probability has always been criticised for the 

following reasons 

(i) This definition assumes that the outcomes are equally likely. The term equally 
likely is alinost as difficult »° the vord probability itself. Thus the definition 
uses the circular »casoning. 

Gi) The definition is not applicable when the outcomes are not equally likely. 

(iii) The definition is also not applicable when the total number of outcomes is 
infinite or it is difficult to count the total outcomes ur the outcomes favourable 


to the event. It is difficult to count the fish in the ocean. Thus it is difficult to 
find the probability of catching a fish of some weight say more than 1 kg. 
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Example 7.2. 

One day 20 files were presented to an income tax officer for disposal. Five files 
contained bogus entries. All the files were thoroughly mixed and there was no 
indication about bogus files. What is the probability that one file with bogus entries 
is selected. 

Solution: 

Here all possible outcomes = 20 

Let A be the event that the file has bogus entries. 

Thus, number of favourable outcomes = 5 

Here we shall apply the classical definition of probability. All the 20 files are 
assumed to be equally likely for the purpose of selecting a file. 

Probability of selecting a file with bogus entries is written as P( A ) 

Number of outcomes favourable toA _ n(A) 25 ee 1 

Thus, P(A) = Number of possible outcomes inS ~ n(S) 20 ~ 4 

Example 7.3, 


A class of 60 students has 30 first divisioners, 20 second divisioners and 10 
third divisioners. One student is selected at random. Find the probability that the 
selected student is (i) a first divisioner (ii) first or second divisioner Gii) second or 
third divisioner. 

Solution: 

There are sixty students in all. Thus there are 60 possible outcomes in S which 
are assumed to be equally likely. We shall apply the classical definition of 
probability to find the required probabilities 
(@ Let A be the event that the selected student is a first divisioner. There are 30 

outcomes favourable to the event A. 

Thus the probability of the event A denoted by P( A)is, 

P(A) = Number of outcomes favourable to A =~ mA) _ 30 _1 
Number of possible outcomes in S n(S) 60 2 
(ii) Let B be the event that the selected student is a first or second divisioner 
Number cf outcomes fayourable toB — n(B 50. 5 

Hence, P(B) = Number of possible outcomes inS ~ n(S) =60-6 
(ii) Let C be the event that the selected student is a second or third divisioner. 

Then the number of outcomes favourable to © is 30. 


= Number ofoutcomes favourabletoC _n(C) 30 1 
Hence, P(C) = Number of possible outcomesinS ~ n(S)~ 60° 2 


Example 7.4. 

A fair die is thrown. Find the probabilities that the face on the die is 

(@) maximum (ii) prime (iii) multiple of 3. (iv) multiple of 7. 
Solution: 

There are 6 possible outcomes when a dic is tossed. We assume that all the 6 
faces are equally likely. The classical definition of probability is to be applied here. 


270 


Basic Statistics Part-I 


The sample space is 


S = {1,2,3,4,5,6}, 
(i) Let A be the event that the face is maximum 


Thus, A = {6}, n(A) = 1, 


n(S) 


P(A) 


(ii) Let B be the event that the face is prime, then 


B = {2,3,5}, n(B) = 3, 


(iii) Let C be the event that the face is multiple of 3 


Hence, C = {3,6}, n(C) = 2, 
(iv) Let D be the event that face is multiple of 7. There is no multiple of 7 in the 


sample space. Thus D = 9, P(D) = 


Example 7.5. 


P(B) 


P(C) 


n(D) _ 
n(S) ~ 


6 

n(A) _ 1 
n(S) ~~ 6 
n(B)_ 3_1 
n(S)) 6.02 
MEG): tineZs 6) 1: 
n(S)" 6°38 


0 (not possible ) 


Two dice are rolled. Let A be the event that both faces are same, B be the event 
that total on the two dice is less than 5 and C be the event that there is at least one 
ace on the two dice. Write the sample space S and the outcomes which belong to A, 
B, C, AUB, ANB and find their probabilities. 


Solution: 
When two dice are rolled, there are 36 possible outcomes which are given in 
Table 7.1. 
The events (sub-sets) A,B,C, AUB and ACB are written as: 
A= {1}, @2, (3), (44), 6,5), (,6)}, n(A)=6 
B = {(1), (12, @D, (3), @ 1), (2,2)}, n(B)=6 
c= {4), (2 3), 24 G5, (6, (2,0, 
3D, 4D, 6D, 6D}, n(C)=11 
AUB = {(1,1), (2,2), (3), (4,4), (6,5), 6 6), (1,2), 
21), (1,3), (3,)}, n(AUB)=10 
ANB= {(1,)), (2,2)},n(AnB) = 2 
Hence P(A) = ay = £- z 
P(C) = BCC) uv 
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P(AUB) = Number of outcomesin AUB __ n(AVB) _ 10- 6 
AvuB) = Number of all possible outcomes ~ n(S) 2 3361 F 18 
P(ARB) = Number of outcomes in AB EW NCES hp i lA aa by 
(ANB) = Number of all possible outcomes ~ n(S) 367 —«d18 


Note: The events (A U B) and (A mB) are the events which belong to S and we can 
find their probabilities by using the classical definition of probability. 
Example 7.6. 

Suppose a fair die is rolled. Let A be the event that the face on the die is prime 
and B be the event that the face is even. Write the elements of the following events 
and find their probabilities. 


@A (i) B (ii) A (iv) 


Wi) AUB Wii) ANB Wii) AUB (ix) AUB 
Solution: 
When a die is rolled, the sample space is 
S = {1,2,3,4,5,6} 
As the die is fair, therefore all the 6 outcomes are equally likely. Each outcome 
has probability of 1/6. We shall apply the classical definition of probability to find 
the probabilities of the given events. 


Thus A = {2,3,5} B= {24,6} 
A= {1,46} B= {1,36 
ANB= {2} AUB= {2,3,4,5,6} 


ANB= S-AnB= {1,3,4,5,6} 
AUB= S-AUB= {1} 


AUB= S-AUB= {2,3,4,5,6}= AUB 
Hence) P(A) = Tisy = § = 3 G) PB) = ney 6.7 2 
Gi) PA) = ate $= 2 ) PB) = Citi 372 
1 0 
(v) P(ANB) = ai =§ (vi) P(AUB)= ee si = A 
a ala re ra pony. MAUE 
(vii) PCAN B) = in =a (vit) AUB = MES = ; 


(ix) P(AUB) = erie = 2 = Paup) 
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Example 7.7. 

A bag contains 5 white, 4 black, 3 green and 6 red balls of the same size and 
weight. A ball is selected at random, find the following probabilities. 

(@) The selected ball is white. (ii) The selected ball is white or black. 

(iii) The ball is not black. 
Solution: 

As all the balls are of the same size, the 18 possible outcomes are equally 
likely. A probability of 1/18 is attached to each ball. The classical definition of 
probability is applied here. 


Thus, (i) P(ball is white) = - (ii) P(ball is white or black) = Z = ; 
(iii) P(ball is not black) = # = q : 


Example 7.8. 
Two uniform coins are tossed. Find the following probabilities: 


(i) both are heads (ii) both faces are same (iii) only one is head 
(iv) at least one is head (v) head on the first coin, 
Solution: 


When two coins are tossed, there are four possible outcomes. Thus the sample 
space 18 
S = {(HH),(TH),(HT),(TT)}, n(S) = 4 
All the 4 outcomes are equally likely. Each outcome has probability of 1/4. The 
outcome (TH ) has tail T of the first coin and head H of the second coin. 
(i) Let A denote the event that both coins give heads, then 


yale 
A = {(HH)}a(A) = 1, PfA) = BE = 7 

(ii) Let B be the event that both faces are same, then 
phot 
B = {(HH),(TT)},n(B) = 2 P(B) = rile 25 

(iii) Let C denote the event that only one is head, then 
Cree 
C = {¢HT),(TH)}, mC) =. 2, PCC) = Stet = 9 = 3 


This event is also called "just one is head" or "exactly one is head". We can also 
say "only one is tail" or "just one is tail". All these statements mean the same 
event C. 

(iv) Let D denote the event that at least one is head. It means any number of 
heads, thus 


D = {(HH),(HT),(TH)}.(D) = 3, RD) = B= F 
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(vy) Let E stand for the event "head on the first coin". The event E means that 
anything may come on the second coin but it should be head on the first coin, 


Thus 
E = {(HH),(HT)},n(E) = 2, P(E) = Sie = 2 2 3 
Example 7,9 


Acard is selected from a deck of playing cards. Find the following probabilities: 
(i) thecardisred (ii) the card is a king (iii) the card is of diamonds 
(iv) the card is a king or queen (v) it is a faced card, 

Solution: 


A deck of playing cards has 62 cards which are of the same size. When a card is 
selected, there are 52 possible outcomes which are assumed to be equally likely, 
Thus the classical definition of probability is applied here to find the probabilities of 
the events belonging to this sample space. 


Here, total number of possible outcomes = n(S)= 52 
(i) The deck has 26 red cards, therefore 
B( card ie pape Number of red cards ae one 
Total number of cards 52 2 


(ii) There are 4 kings in the playing cards, therefore 
Number of kings 4 ah 


Total number ofcards ~ §2 ~ 13 


(iii) There are 13 cards of diamonds, thus the event contains 13 points out of 52 
points all of which are equally likely. Therefore 


P(cardisaking) = 


{ i Pe age | 
P( card is ofdiamonds) = es 4 
(iv) There are 4 kings and 4 queens, thus the event contains 8 points. Hence, 


8 2 
P( card is a king or queen ) = 52> 13 


(vy) The jacks, queens and kings are called faced or pictured cards. Thus there are 
12 faced cards. Hence, 


_ P(cardis faced) = 527 13 


Example 7,10. 

A digit is selected at random from the first ten natural numbers. Find the 
probability that the selected digit is (i) greater than 6 (ii) a complete square 
(iii) multiple of 3 (iv) a prime number less than 3. 

Solution: 

There are 10 possible outcomes in the sample space S. It is assumed here that 

all the 10 outcomes of the sample space are equally likely. Each digit has probability 
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of selection equal to 1/10. Thus we shall apply here the classical definition of 
probability. 


S = {1, 2,3, 4,5, 6, 7,8, 9, 10} 


Total number of possible outcomes = 10= n(S) 
(i) Subset of digits greater than 6 = { 7, 8,9, 10} 
ang Number ofdigitsinthe subset _4 2 
P( digit is greater than6) = “total number of digits  ~ 10 ~ 5 
(ii) Subset of digits which are a complete square = { 1,4, 9} 
eaies _ _Number of favourable outcomes. _ 3 
P( digit is a complete square ) = Total number of possible outcomes ~—:10 
(iii) Subset of multiples of 3 = { 3, 6,9 } 
Number of favourable outcomes__ 3 


P( digit is multiple of 3) * Total number of possible outcomes 10 


(iv) Subset of a prime number less than 3 = {2} 
P( digit is a prime number less than 3 ) 


Number of favourable outcomes _ 1 
Total number of possible outcomes ~ 10 


7.4.2 THE RELATIVE FREQUENCY DEFINITION 


This definition is based on the actual observations when the random 
experiment is repeated a large number of times under similar conditions. Suppose a 
random experiment is repeated N times and it is observed that a certain event say A 
has actually occurred n times, then the probability of the event A is defined as the 
ratio n/N where N is very large. 


3 Limit n 
In symbols, we write P(A) = en 

This definition is also called the empirical or the statistical definition because 
we observe the occurrence of the event when the experiment is being repeated. It is 
assumed in this definition that the ratio n/N will become a stable value when N is 
repeated a large number of times. This stable value of the ratio is called the 
probability of A and is written as P( A). 

This definition is closer to the real world situations. If we are interested to 
know whether or not a coin is true, we can toss a coin a large number of times and 
the number of heads can be counted. The probability of head is given by 

P(head) = Number of heads actually observed 
CI) Total number of throws 

This ratio may or may not be equal to 1/2. It is not possible to find this ratio by 
classical definition. Suppose we take a sample of 100 bulbs from a certain factory 
and examine them. We find that 20 are defective. The ratio 20/100 is called the 
proportion of defective bulbs. This ratio is 0.2. It is also called the probability of 
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selecting a defective bulb provided sample size is increased and the ratio reaches 

some stable value. This definition can be applied in real world problems for finding 

the probabilities. But this definition also has certain deficiencies such as zi 

() The experiment is to be repeated a large number of times under the same 
conditions, It is difficult to maintain exactly the same conditions for lengthy 
experiments. When a coin is tossed a large number of times, the coin may 
reduce in size. The surface on which the experiment is done may not remain 
the same, 

(ii) Repeating the experiment a very large number of times is a very time 
consuming and sometimes very costly affair. Suppose we want to find the 
probability that a bullet or a missile will fire. It is not possible to repeat the 
experiment an infinite number of times because a lot of expenditure is involved 
on this experiment. 

Example 7,11. 

Mr. A has played 10 games with Mr. B in the past. Six games were won by Mr. 

A. They have decided to play a game. What is the probability that Mr. A will win the 

game. 

Solution: , 

An experiment has been conducted in the past and it has been observed that 

the event ( Mr. A wins ) has occurred 6 times. The experiment has been repeated 10 

times and the event has occurred 6 times. We apply here the relative frequency 

definition of probability to find the probability that the game they are going to play 

will be won by Mr. A. 

Thus, P(Awillwin the game)= 6/10 = 0.6 

Example 7.12. 

A coin has been tossed different number of times and number of heads have 
been recorded. The results of the experiment are given below. What do you think is 
the probability of getting head on the coin. 


Number of throws Number of heads 
50 30 
100 70 
200 125 
300 180 
5000 2560 
10000 5100 
20000 10202 
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Solution: 


The random experiment has been conducted to find the probability of head on a 
coin. We find the relative frequencies of heads in the following table. 


Relative frequency 


Number of throws Number of heads f 
(N) (f) ofheads (£) 


According to the relative frequency definition of probability, the ratio f/N is 
called the probability of the event provided the ratio f/ N reaches some stable value. 
In this experiment we see that the ratio f/ N becomes stable at 0.51 when the coin 
is tossed more than 5000 times. Thus we can say that the probability of head is 0.51 
for the coin with which the experiment has been conducted. When the probability of 
head is not 1/2, the coin is called biased. 

7.4.8, THE AXIOMATIC DEFINITION OF PROBABILITY 

This definition is based on certain axioms which must be satisfied by the 
probability of an event. Suppose a sample space S has the sample points Ey, Eg, +, 
E,, ..., E,. Each point is assigned a real number like PE)), P(E,), -., P(E), »», P,) 
which are called the weights or probabilities of the points. Probability of any point 
denoted by P(E,) must obey the following axioms : 

@ OsP@)s1 

The probability of the event must lie between 0 and 1. 

Gi) P(S)= 1 for thé event which is as large as the sample space. 
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(ii) IfA and B are mutually exclusive events, then 

P(AorB) = P(AUB) = P(A)+ P(B) 

According to this definition, a non-negative number is assigned to each point in 
S such that sum of all such numbers is 1. In this definition the condition of equally 
likely outcomes is not necessary. The outcomes may or may not be equally likely, 
When they are equally likely, each point in S will have the same probability. 
Probability of an Event A 


If an event A belongs to the sample space S, then the probability of A denoted 
by P(A) is the sum of the probabilities of all sample points B, included in A. 


ThusP(A) = ZP(E,) 

It is possible that all E, are equally likely. In this case PC E,) = P( E,) = +» 
= P(E). Each probability is 1/n because these are n equally likely points. 

IfE,, E., E, are included in the event A, then 


1 ik 1 

P(A) = P(E,)+P(E,)+P(E,) = Spot peatigg 
- 3 _ Number of sample points in A 
~ mn ~ Number of sample points in § 


Example 7,13. 

A coin has been designed in such a manner that the head has been assigned a 
weight of 3w and tail has been assigned a weight of 2w. Find the probability of head. 
Solution: 

According to the axiomatic definition of probability, the total weight of all 
possible outcomes must be 1. We can write 


Total weight = 8w+2w = 1, bw= 1 or w = 1/5 
Hence P(head)= 8w = 3x 1/5= 3/5= 0.6 
and P(tail) = 2w = 2*1/5= 2%5= 04 
We can say that the coin is not true. 
Example 7,14, 


Six faces of an unbalanced die are assigned weights as below. Find probability 
of each face, : 
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Solution: 
We shall apply here the axiomatic approach to probability. The total weight of 
all possible outcomes must be 1. Thus 
Total weight = w+2w+w+2w+wt2w = 1 or 9w =1 and w = 1/9 
Thus the six faces of the die have the probabilities as below: 


Mr a ea ne Si 
| 29 | vo | 2 | wo | 20 | 


The probability of each odd face is 1/9 and the probability of each even face is 2/9. 
Note: According to the classical definition of probability, the probability of head on a 
coin is always 1/2, In Example 7.12, the probability of head has been calculated by 
relative frequency definition and it is 0.51 for a certain coin. This calculation is for 
the particular coin which has been tossed 20000 times. In Example 7.13, the 
probability of head has been calculated by axiomatic approach and it is 0.6 for a 
certain coin. 

7.4.4, SUBJECTIVE OR PERSONALISTIC PROBABILITY 

A person may have some confidence or belief regarding the occurrence of some 
event, say A. The numerical measure of this confidence is called the subjective 
probability of the occurrence of A. This probability is based on the experience, 
intelligence and knowledge of the person who is determining the probability in some 
situation. For example, we may be interested to know whether a certain political 
system will succeed in a country or not. The probability of success in this situation 
cannot be determined by objective definitions of probability. The assessment of this 
probability is made by some expert. This approach can be applied in real world 
situations. This probability is subjective in nature. Different persons may have 
different probabilities for the same situation at the same time. 
7.5. ADDITION LAW FOR MUTUALLY EXCLUSIVE EVENTS 
Theorem: 

If the two events A and B are mutually exclusive then the probability that any 
one of them will occur is the sum of the probabilities of A and B. In symbols, 

P(AorB) = P(AUB) = P(A)+P(B) 


Proof: 

Suppose there is a sample 
space § containing N_ points 
(outcomes) which are equally 
likely. The two events A and B 
belong to S and contain n, and n, 
points respectively. There is no 
point common to A and B. As A 
and B are disjoint, the union of A 
and B contains (n, + n,) points. Renowenaded 
Using classical definition of Fig.7.7. 
probability, we can write 


N point. 
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Number of outcomes favourable to AU B 


P(AUB) Total number of outcomes in S 
+n2 
= Bye Rte = PCA)+R(B) 
Thus P(AUB) = P(A)+P(B) 


The law can be generalised for more than two events. If Ay, Ag -, Ay are'k 
inutually exclusive events then 


P(A,UA,U 4. UA) = P(A,) + P(Ag) te + PC AY) = EPA 


‘Asn special case, ifA,UA,U..UA, = § 
then P(A,UA,U...UA,) = P(S)= 1 
P(A,)+P(A,)+..+P(A) = 1 ‘ 
Here Aj, A, «.., A, are called exhaustive events. They are mutually exclusive 


and they cover the whole sample space. 
Corollary 1: If is an empty set, then P(d) = 0 
Weknow $US = S and P(6US)= P(S) 
$ and S are mutually exclusive events, therefore 
P(¢)+P(S)= P(S) or P(b) = P(S)-P(S)= 0 
Corollary 2: If Ais complement of a set A relative to same sample space S, then 
AVA=S 
and P(AUA)= P(S) = 1 A A 


Aand A are mutually exclusive, therefore 


P(A)+P(A)=1 or P(A) = 1-P(A) 

This is called law of complementation of two sets. 
Example 7.15. 

A coin is tossed 6 times. Find the probability that there is at least one head. 
Solution: 

The sample space S has 2° = 64 possible outcomes. There is only one outcome 
( TTTTTT ) in S which does not have any head. All the other 63 outcomes have 
either one or more heads. It is difficult to write all the 63 outcomes. Let A be the 


event "at least one head" and A be the event "no head". 
Aand A are mutually exclusive and exhaustive events, therefore 
AVA= § and P(AUA) = P(S) 


P(A)+P(A)= 1 or P(A) = 1-P(A) = 1-1/64 = 63/64 
Hence, P( at least one head) = 63/64 
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Example 7.16. 

A card is selected at random from a deck of playing cards. Find the probability 
that the card is a king or a queen. 
Solution: 

There are 52 possible outcomes which are equally likely. Each card has 
probability of 1/52. 

Let A represent the event "card is a king" and B represent the event "card is a 
queen". There are 4 kings and 4 queens in a deck. The events A and B are mutually 
exclusive, therefore 3 


P(AUB) = P(A)+P(B) = in + és = & 

We have already solved this question in Example 7.9 but here we have used a 
law called the addition law of disjoint events. 
Lxample 7.17. 

A die is rolled. Find the following probabilities. 

() The face is multiple of 3 or multiple of 5. 

(i) The fave is a complete square or it is the maximum face. 
Solution: 


When a die is rolled, the sample space, is S = {1,2,3,4,5, 6} 
@) Let A = {3,6} B = {5} 
Aand B are mutually exclusive events, therefore 
P(AUB) = P(A)+P(B)= 5 +2 = 222 
Alternately" 
A = {3,6} B= {5} AUB= {3,5,6} 


From the classical definition of probability of an event, we have 


P(AUB)= Number of outcomes in AUB n(AUB)_ 3 1 

(AUB)= Total number of possible outcomes Mow CET 2 
Gi) Let A = {1,4} B= {6} 

A and B are mutually exclusive events, therefore 
9 

PAUB) = P(A)+P(B)= § +2=2 = 2 

Alternately 
A= {1,4} B= {6} AUB= {1,46} 


P(AUB)= PUPS). 8 1 
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When the events A and B contain small number of points, we can list the points 
in A, B and A uv B and we can find the probability of the event A U B by using the 
definition of probability ie., 

‘ 
_ n(AUB) 
P(AUB) = “sy 

But when it is difficult to list the clements in A U B, we would use the addition 
law of probability for mutually exclusive events. 
Example 7.18. 

Two dice are rolled. Find the probability that the total is less than 12. 
Solution: 

When two dice are rolled, there are 36 possible points. In the sample space 
there are 35 ( please see Table 7.1 ) pairs in which the total is less than 12, Let A 


represent these 35 outcomes and A represents only one outcome ( the total of 12) 
P(A) = 1/36 
As A and A are disjoint, Therefore P(AVA) = PR(S) = 1 


P(A)+P(A)= 1 or P(A) = 1-P(A) = 1- 4/36 = 35/36 
Example 7.19. 


If A and B are two mutually exclusive events from a sample space, then is it 
possible that P(A) =0.7 and P(B)=0.6? 


Solution: 
As A and B are mutually exclusive, therefore 4 
AUBs § s 
If A and B are exhaustive events, “ E 
thnAUB= § 
If A and B are not exhaustive events, AUB=8 
A and B are exhaustive events 
Fig.7.9. 
thnAUB< § 
ThusAUBs § " Hl 
and P(AUB)s P(S) 
P(A)+P(B)s 1 
0.7 + 0.6 £1 AUB<S 
A and B are not exhaustive events 
Fig.7.10. 


Hence A and B are not mutually exclusive events. The given probabilities are 
wrong. 
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Example 7.20. 

A digit is selected at random from the first 100 natural numbers. Find the 
probability that the selected digit is multiple of 10 or multiple of 11. 
Solution: : 

Let A be the event "digit is multiple of 10" and B be the event "digit is multiple 
of 11", 

Then, n(S)= 100 

n(A) = 100/10= 10 points (10, 20, 30, 40, 50, 60, 70, 80, 90, 100) 
n(B)= 100/11= 9 points (11, 22, 33, 44, 55, 66, 77, 88, 99) 

Any multiple of 10 is not common with the multiples of 11. Thus A and B are 

mutually exclusive, therefore 


P(AUB) = P(A)+P(B) = yoy + Gog = GS 
Example 7.21 ‘ 
A, B and C are taking part in a race. The chance of winning the race by A is 
half of that of B and B winning the race is half of that of C. Find their 
respective chances of winning the race. 
Solution: 
Let the probability of winning the race by A is p ie; 
P(A) =p, then P(B) = 2p and P(C) = 4p. 
Since A, B and C are mutually exclusive events, therefore 
PAUBUC)= P(A)+PB)+P(C) 
1 = p+2p+4p or 7p= 1 or p = 1/7 
Hence P(A) = 1/7, P(B) = 2/7 and P(C) = 4/7 
7.6 ADDITION LAW FOR NOT MUTUALLY EXCLUSIVE EVENTS 
Theorem: : 
If A and B are not mutually exclusive events then the probability that A or B or 
both occur is given by 
P( Aor Bor both) = P(AUB) = P(A) +P(B)—-P(AUB) 
Proof: 
Suppose there is a sample A(n, points) —_ B (n; points) 
space § containing N points 
which are equally likely. The 
two events A and B belong to S 
and contain n, and n, points 
respectively. The events A and 
B overlap each other with the 
cesult that A B has m points. 
Thus m points are commen to A 
and B. AU Bean be written as AB has m points 
union of two disjoint events Au Bis shaded 
which are Fig.7.11. 


s 


N points 
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@ A (i) (B-ANB) 
Thus we can write AUB= Au(B-AnB) 
A contains n, points and (B—A/B) contains (n,—m ) points 
Thus AM B has n, +(n,—m) points 
Using classical definition of probability, we have 


ee Number of points in AUB _ mitne=m _ ni | ne 
P(AUB) = Number of points in S$ N NtN 
P(AUB)= P(A)+P(B)-P(ANB) 

Which is called the general law of addition of probabilities. 

When A and B are mutually exclusive, then AW B = > and PCANB) = 0, 
we get PC AUB) = P(A) + P(B) which is the addition law for mutually exclusive 
events. For three not mutually exclusive events, 

PAUBUC) = P(A)+P(B)+P(C)—-P(ANB)-P(ANC) -P(BNC) 
+P(ANBNC) 

When A, Band © are mutually 
exclusive, then 

P(ANB) = P(ANC) 


= P(BNC) Ss 
= PQANBNC) 
= 50) 
ANBnC=6 
A, B, C are mutually exclusive 
Fig.7.12. 


Therefore, PCAN BOC) = P(A) +P(B)+P(C) 
This is called addition law of probability for mutually exclusive events. 
Example 7.22. 
A die is tossed. Find the probability that the face is a prime or is even number. 
Solution: 
When a die is tossed, there are 6 possible outcomes which are equally likely. 
Thus 
S = {1,2,3,4,5,6} and n(S) = 6 
Let A represent the event "face is prime" and B represent the event "face is 
even" then 
Ars {2 S60) Baay Gord oh AaB {2} 
A and B are overlapping. Using the addition law for not mutually exclusive events 


wehave, P(AUB) = P(A)+P(B)-P(ANB) = 3 + 2 - a2. 
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Alternate Solution 
We have already solved this question in Example 7.6. 


Wehave S = {1,2,3,45,6} A = {2,35} B= {2,46} 
A 
AUB= {2,3,4,5,6} P(AUB)= —- é 


Example 7.23. 


Two coins are tossed. Find the probability that both faces are heads or at least: 
one is head. 


Solution: 

When two coins are tossed, the 
sample space S contains 2? = 4 possible 
outcomes. Let A represent the event 
"both are heads" and B represent the 
event "at least one is head". Thus 


A = {(HH)} 
BS { (HH), (HT), (TH)} AU Bis shaded 
ANB= {(HH)} Fig.7.13. 
is Bak Oh 3: 
P(AUB) = P(A) +P(B)—P(ANB) See nate a: 


Which is the required probability. 


Alternately 
Wehave S = {(HH),(HT),(TH),(TT)} A ={(HH)} 
B = {(HH),(HT), (TH)} AUB ={(HH), (HT), (TH)} 
A 3 
P(AUB)= se =e. 


Example 7.24. 

The probability is 0.6 that Mr. A will pass the examination and the probability 
is 0.8 that Mr. B will pass the examination. Both sit in the examination. Find the 
probability that somebody will pass the examination. 

Solution: 

Both A and B can pass the examination. Therefore, the events A and B are not 

mutually exclusive. 
Hence, P(AUB)= P(A)+P(B)-P(ANB) = 0.6+0.8—(0.6) (0.8) 
= 0.6+08-048 = 0.92 
Note: We have assume* ... A and B are independent events i.e., 
P(A \PY= P(A) P(B) 
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Example 7.25, 

A digit is selected at random from the first 20 digits. Let A be the event that 
the digit is multiple of 4 and B be the event that the digit is multiple of 6. Find the 
probability i) ANB (ii) AUB. 


Solution: 
Here, S = {1,2,3,..,20}, (S$) = 20 
A = {4,8,'12, 16,20}, n(A) = 5 
B = {6,12, 18}, n(B) = 
ANB= {12}, n(AnB)= 1 
pay Os) Rae = WCB). 8) 
BA) = 3(8) = 20 ECE Syl aeao 
P(ANB)= on =o 
Thus, 


P(AUB)= P(A)+P(B)—-P(ANB) 20 * 30 ~ 20 = 30 


Alternative Method 


S = {1,2,3,..., 20} 
A = {4,8,12,16,20}, B = {6,12 18}, 
AUB = {4,6,8, 12, 16, 18, 20} 


According to classical definition of probability 


P(AUB)= Number of points in AUB aha 


Number of points in S 20 


Example 7.26 
The following contingency table is set up: 


Determine the ee of: 


(i) event A (ii) event B (iii) event A 
(iv) event A and B (vy) event AandB (vi) event A and B 
vii) event AorB (viii) event A or B (ix) event AorB 


(x) vert A or B 
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tai Dorer mbred iy rok 


Solution: 


A) _ 40 é hy ee 
@ Pay =e = Soden Ray BS SE = oss 

A _n(AnB)_10 _ 
(ii) P(A) = ae = - 060 Gv) P(A and B) = "ET = 05 = 0.10 
(%) P(A and B) = 9 AGB) -39-<9.50 | (7) P(A and B) = BACB) - 35 - 0.95 


n(S) ~ 100 
4035 10 65 | 
100 * 100 100 = 100 = 9-65 
40.65 30 75 
100* 100 100 ~ 100 ~%75 


(vii) P(A or B) 


P(A) + P(B)-P(AMB) = 


(viii) P(A or B) P(A) + P(B)-P(ANB) = 


(ix) P(A or B) P(A)+P(B)-P(A 4B) = 


x x 60 35 25 70 
P(A) + P(B)- P(A AB) = Yoo * 100~ 100 = 100 = 


(x) P(A or B) 0.70 


Example 7.27 
A and B can solve 70 % and 80 % of the problems in a book respectively. Find 
the chance that a problem chosen at random will be solved by at least one of 
them. 

Solution: 


P(A) = = 0.7, P(B) = 2 = 0.8 and P(A q B) = P(A) P(B) = 0.7 x 0.8 = 0.56 


(: A and B are independent) 


Since A and B are not mutually exclusive events, therefore 
P(AUB) = P(A)+P()-P(ANB) = 0.7+ 0.8-0.56 = 0.94 

Example 7,28 
A class contains 10 boys and 20 girls. Half of the boys and half of the girls have 
brown eyes. Find the probability that a student chosen at random is a boy or 
has brown eyes. 

Solution: 


n(S) = 3; = 30 
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Let A = astudentsisaboy and B = astudent has brown eyes 


n(A) = C, = 10 mays Me Bd 

n(B) = 1C) = 15 yp) = A Bed 

n(ANB)= 8Ci = 5 Wane) = =-5-i 
Since A and B are not mutually exclusive events, therefore 
P(AUB) = P(A)+P(B)-P(AnB) = 3 +4 -4= aeS-1_4_2 


Example 7.29 
A drum contains 50 bolts and 150 nuts. Half of the bolts and half of the nuts 
are rusted. (i) If one item is chosen at random, what is the probability that it is 
rusted or is a bolt? (ii) If two items are chosen at random, what is the 
probability that both are rusted or both are nuts? 
Solution: 
Total items = 50 bolts + 150 nuts 
Rusteditems = 25 bolts + 75 nuts 
@ — n(S) = 200C, = 200 
Let A be the event that an item is rusted 
and B be the event that an item is a bolt 


200 
100 


Won 


00 r 

n(A)= 1, = 100 P(A) = eek ae 
i 

4 


B 50 
n(B)= 5C; = 50 P(B) = ae gan = 
n(ANB) = 20, = 25 P(AnB)= me = =} 
Since A and B are not mutually exclusive events, therefore 
P(AorB) = P(A)+P(B)-P(ANB) = 5 +4-2= 2 = 0625 


(ii) n(S) = 200C, = 19900 
Let C be the event that both items are rusted 
and D be the event that both items are nuts 


: 
n(C)= 1G, = 4950 P(C) = ela 2 ee 0.25 
n(D)= 150G, = 11175 P(D) = — = te = 0.56 


= 8G, = = MCCAD) _ 2775 _ 
n(CnD) = C2 2775 P(C AD) = n(S) = 19900 =0.14 
Since C and D are not mutually exclusive events, therefore 
P(CorD) = P(C)+P(D) -P(CAD) = 0.25 + 0.56-0.14 = 0.67 
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7.7, CONDITIONAL PROBABILITY 

The probability of an event A is called conditional if it depends upon the 
occurrence of some other event B, The conditional probability of event A when event 
B has already occurred is denoted by P(A/B). It can be shown that 


P(A/B) = PROB) [%(B) #0] 
We read P( A/B) as probability of A when B has occurred or "the probability of 
A given B". 
Explanation 


The event B has already occurred is called "the additional information" about 
the sample space S. The additional information reduces the size of the sample space 
S and the remaining part of S is called the reduced sample space which may be 
denoted by S,. Any probability calculated from the reduced sample space is called 
conditional probability. When the event B has occurred, then the reduced sample 
space consists of B. Thus S, = B. When it is known that B has occurred, then we are 
concerned with reduced sample space and not with the original sample space. 
Suppose a die is rolled and the additional information is that the face is even. Now 
we are interested to find the probability that the face is a complete square. When it 
is known that the face is even then the sample space S= ‘i 1, 2, 3, 4, 5, 6} is reduced 


to { 2, 4,6 }. Now we are concerned with this reduced sample space. In the original 
sample space each face has probability of 1/6. Thus in the reduced sample space 
each face has.probability of 1/3. Face 4 is a complete square in the reduced sample 
space. Thus probability of a complete square when it is known that face is even is 
1/3. It is called conditional probability of a complete square when the face is even. If 
B denotes the even face and A denotes the "complete square" then P(A/B) = 1/3. 


We can show that the conditional probability 


P 
P(A/B) = meee P(B)+@ 
B 
Ss 
n points 
N points 
AB has m points 
Fig. 7.6. 


Suppose there is a sample space S which contains N outcomes which are 
equally likely. The event B has n points and there are m points which are common to 
the event A and B. Thus A - B contains m points. When it is known that B has 


aan awe) 
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occurred, then occurrence of A is possible only when any point out of m points 
occurs. By using classical definition of probability, we have 
P(A/B) = ee (As B has occurred, therefore B becomes sample space for A) 


When both numerator and denominator are divided by N, we get 


/ 
P(A/B) = @ = aN = ae [P(B) #0] 


when P(B)= 0, then P(A/B) does not exist, 


When ANB 4, P(ANB)= 0, therefore P(A/B) = 0 
Similarly, P(B/A) = ae [P(A) #0] 


When ANB= 4, P(ANB)= 0 and P(B/A) = 0 


Thus any conditional probability will be greater than zero only when AW B¥ > 
which means that A and B are not mutually exclusive events. 
Example 7.30. 


A fair die is rolled. Find the probability that the face is even given that the face 
is less than 4. 


Solution: 


We have to calculate the conditional probability that the face is even when it is 


known that the face is less than 4. We can find this conditional probability by two 
different methods. 


MethodI: By Sample Space Approach 
When a die is tossed, the sample space of 6 equally likely outcomes is, 
S = {1,2,3,4,5,6} 
Let A denote the event "face is even" and B denote the event "face is less than 4" 
Then A = {246} B= {1,2,3} and AnB= {2} 
From the classical definition of probability, we have, 


PR(B) =  s g P(AMB) = ae). ‘ 
P(ANB) 16 1 
P(A/B) = P(B) =~ 36 = 3 


Which is the required conditional probability. 
Method II: By Reduced Sample Space 
When a die is tossed, the sample space is 
S = {1,2,3,4,5,6} 
When it is given that the face is less than 4, then the reduced sample space is 
8: = {12,3}, n(S:) =3 


Let A= Face is even, n(A)=1, P(A) = et = 


wl 
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Example 7.31. 

The workers in » factory are divided into 4 groups as shown below. A worker is 
selected at random, find the probability that the worker is skilled when it is given 
that the worker is a male. 


Solution: 
Let A denote the event "worker is skilled" and B denote the event "worker is a 
male", then A contains 110 workers and B contains 120 workers. 


There are 80 workers who are males and skilled. Thus A 14 B contains 80 
workers. 7 
Number of workers in B 120 80 


ents E(B) = ‘Total number of workers. 1602 224 PCAOB) = 769 
PCAOB) _ 80/160 go 2 


Hence, P(A/B) = 


Alternate Solution 


The conditional probability P( A/B) can be very easily calculated from the 
idea of reduced sample space. When it is given that the worker is male then the 
reduced sample space is 


Unskilled 


Example 7.32. 

Two dice are rolled. Find the probability that the total on the two dice is less 
than 5 given that both the faces of dice are similar. 
Solution: 

When two dice are rolled, there are 36 possible outcomes as given in Table 7.1. 
All the 36 outcomes are assumed to be equally likely, each having probability of 
1/36. 


P(B) ~ 120/160 ~ 120~ 3 


In the reduced sample space, 80 workers are 
skilled. Thus the required conditional probability is 
80/120 = 2/3 which is same as calculated earlier. 
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Let A be the event "total on the two dice is less than 5" and B be the event 
"both faces are similar". 


Then A = {(1, 1) (1,2), @ 1), (1,3), @, 1), @,2)}, nA) 
B= {(,}), 2,2, @,3), (4,4), 6,5), 6, 6)}, n(B) 

and ANB= {(1,1), (2,2)}, n(AnB) = 2 
1 
Thus P(B) = =o = oe $ 
DCE Benge, 
PAOB) Secancg nage rre 
Hence, P(A/B) = KUN odd) aged af 


Example 7.33, 

Given S = {1,2,3, 4,5, 6, 7, 8} 

If A = {1,2,7} and B = { 2,3, 5,6} 

Find @) P(A/B) (ii) P(B/A) (iii) P(B/A) (iv) P(A/B) 

What relation do you find between P(A) and P(A/B) and P(B ) and P(B/ A), 
Solution: 


Given S = {1,2,3,4,5,6,7,8} _ 
Ay = 3172.7}, 9) B.=5{0\375..6) 


Then A = S-A = {3,4,5,6,8}, AnB= {2} 
AB = {3,5,6} 
Hence, P(A) 


" 
Ie col 
= 
les} 
LS 
i 
aE 
v 
zx 
> 
L 
i 
olen 


P(ANB)= 3, P(AMB)= : 
AB 8 1 
@) P(A/B) = PSB) =e =H 
Hence, P(A/B) < P(A) 


$ PBOA)_ P(ANB) 18 _ 1 
fo ECB A i= seh yaaeea ne dae Baia 


Hence, P(B/A) < P(B) 
Gi) PBA) = PAB) _ 38 _ 3 
P(A) 
Hence, P(B/A) > P(B) 
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Gv) p(A/B)= ACB). 28 es 


Hence, P(A/B) > P(A) 

In (i) and (ii) above, we find that both the conditional probabilities P( A / B ) 
and P( B/ A) are less than the ordinary probabilities and in (iii) and (iv) above, both 
conditional probabilities are greater than the respective ordinary probabilities. 

Thus if A and B are two events then both conditional probabilities P( A / B ) 
and P( B/ A) will be greater or both will be less than their ordinary probabilities or 
both will be equal to their respective ordinary probabilities 

If P(A/B)= P(A), then P(B/A) = P(B) 

The proof of this fact is beyond the scope of this book. 

In a topic on Independence we shall say that the events A and B are 
independent if 

P(A/B) 

If P(A/B) 
Example 7.34. 

Given S = {1, 2,3, 4,5, 6, 7,8, 9,10} A= {1,3,8,9, 10} B={1,3,6,7} 

write the elements of the events A B and find the probabilities 

G@ P(A)and F{A/B) (i) P(B) and P(B/A) 


P(A) and P(B/A) = P(B) 
P(A), then P(B/A) is always equal to P( B). 


“ow 


Solution: 
Given S={1, 2, 3, 4,5, 6,7,8,9,10} A= {1,3,8,9,10} B= {1,3,6,7} 
We have 
AnB= {1,3} and PAnB = % 
ee AGB) 2G) 2) 07 
@ MA)=th= de, PAIR) egy gk ee 
Thus P(A) = P(A/B) 
4 ae P(BAA) 210 2 
ay Peppa =%, pepsay = BOM). Te We 


Hence, P(B) = P(B/A). Here, P(A/B)=P(A) and P(B/A)=P(B). 
This is a condition of independence which will be discussed later. 
Example 7.35. 
Two coins are tossed. Find the following probabilities. 
(i) both are heads given that at least one is head. 
(ii) the first coin shows head when it is known that one is head and the other 
is tail. 
Solution: 
When two coins are tossed the sample space of 4 possible outcomes is 


S = {(HH),(HT),(TH),(TT)}, n(S) = 
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It is assumed that the four outcomes are equally likely and each outcome has 
probability of 1/4, 


@ Let A denote the event "both are heads" and B denote the event "at least one is 
head" then 
A = {(HH)}, n(A)= 1 
B = {(HH),(HT),(TH)}, n(B) = 3 
ANB= {(HH)}, n(AnB) =1 
B 3 ANB 1 
p(B)= 2) = fond AmB) = MAG) _ 1 
P(ANB 1/4 1 
Hence, P(A/B) = be 4 sa =i4 1 
Alternately = 
When it is given that at least one is head, then the reduced sample space is 
S:= {(HH), (HT), (TH)}, n(Sr) = 3 
ye Ls =-mMA) _1 
Let A= Two heads, n(A) = 1, P(A) = nS) 8 
(ii) Let A denote the event "the first coin shows head" and B denote the event "one 
is head and the other is tail" then 


A = {(HH),(HT)},n(A) = 2 

B = {HD, (TH}, n®) = 2 

ANB= {(HT)}, n(AQB)= 1 
Thus, P(B)= rest = 2 and P(ANB) = 2 at =a 


Hence, P(A/B) = PLAOB) oar = 4 =} 
Alternately 
When it is known that one is head and the other is tail, then the sample space 
Sis reduced to {(HT), (TH)}, n(S-) = 2 
Let A= First coin shows head, n(A) = 1, P(A) = BAI = 4 
In the reduced sample space one outcome is favourable to the event A. Thus the 
required conditional probability is 1/2. 
Example 7.36. 
Three babies are born in a family. Find the following probabilities: 
(i) a family has 3 male babies given that all the 3 babies have the same gender. 
(ii) a family has 2 male babies when it is known that the first baby is male. 
(iii) a family has 2 girls given that the middle baby is a male. 


(iv) a family has 2 male babies given that the first and last babies have the same 
gender. 
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Solution: : 

It is assumed here that probability of male birth is equal to that of female 
birth. There are 2°=8 possible outcomes in the sample space S. Each outcome has 
probability of 1/8. A boy is denoted by B and a girl is denoted by G. The sample 
space is 


S = {( BBB), (GBB), (BGB), (GGB), (BBG), (GBG), (BGG), (GGG) } 
n($) =8 

(i) Let A denote the event "3 male babies" and B denote the event "3 babies have 
the same gender". then 


A = {(BBB)}, n(A) = 1 
B = {(BBB),(GGG)}, n(B) = 2 


ANB= {(BBB)}, n(ANB)= 1 
B : 
P(B) = oe ‘ and PAnB) = MAO a2 


Hence, P(A/B) = naee = 1 


which is the required conditional probability. 


Gi) Let A denote the event "2 male babies" and B denote the event "first baby is 
male" then 


A = {(GBB),(BGB),(BBG)}, n(A) = 3 
B= {(BBB),(BGB),(BBG),(BGG)}, n(B) = 4 
AnB= {(BGB),(BBG)}, n(AnB)=2 

PCB) eee ae P(ANB)= Menb) . 2 


n(S)~ 8 n(S) 
Hence, P(A/B) = PACB) ar ial 


(aii) Let A denote the event "2 girls" and B denote the event "middle baby is a male" 
then A = {(GGB),(GBG),(BGG)}, n(A) = 3 
B = {(BBB), (GBB ), (BBG), (GBG)}, n(B) = 4 
AQB= {(GBG)}, n(ANB)= 1 


P(B) = Se § and P(ANB)= Bere 


Thus’ P(A/B) = es = 8 


" 


1 
4 
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(iv) Let A denote the event "2 male babies" and B denote the event "first and last 
babies have the same gender" then 


A = {(GBB),(BGB),(BBG)}, n(A) =3 
B = {(BBB),(BGB),(GBG),(GGG)}, n(B) = 4 
ANB= {(BGB)}, n(AnB) = 1 


B 4 AONB 1 
ME) = ay = 3 and Rann je Moe =3 
Therefore, P(A/B) = ay es = 18 1 


7.8. INDEPENDENCE OF EVENTS 
The two events A and B are called statistically independent if and only if 
P(AMB) = P(A) (B) P(A) # 0, P(B) #0 
A and B are also independent if 
P(A/B) = P(A) P(B) #0 
or P(B/A) = P(B) P(A) #0 
In daily life the two events A and B are called independent if they have 
"nothing to do with each other" or they have no concern with each other. For 
example one fan is selected from factory A and another fan is taken from factory B. 
We are interested to find the probability that both are good. To find this probability 
we use the approach "nothing to do with each other". But this is not the statistical 
definition of independence. A and B are independent events if and only if 
P(ANB)=P(A) P(B). 
7.9. MULTIPLICATION LAW FOR INDEPENDENT EVENTS 
Theorem: 
If A and B are independent events having nonzero probabilities, then 
P(AMB) = P(A) (B) ri 
The probability P(A 7 B) is called Joint probability of A and B and P(A) and 
P(B) are called the marginal probabilities of A and B respectively. 
Proof: 
Suppose a sample space S; has n total points. The event A belongs to S; and 
has m favourable points. Similarly the sample space Sz has N total points. The event 
B belongs to Sz and has M favourable points, 


Therefore, P(A) = si Aen GS p hor M 

Since A and B are independent events, the total points for the combined event 
A and B will be nN and total favourable points for joint event A A B will be mM. 
Hence 


mM m 
n 


P(ANB) = Fin ea a = P(A) P(B) 
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This is called multiplication theorem of probability for independent events. It 
can be generalized for more than two independent events. For k independent events, 
P(A, MA, AZO... OA) = PC(A,) P(A2) P( Aj) «-- PC Ay) 
Corollary: If A and B are independent, then the following equations are also true. 


@) P(ANB)= P(A) P(B) 
i) P(ANB)= P(A)P(B) 


(ii) P(ANB)= P(A) P(B) 
Example 7,37. 

Two coins are rolled. Show that the event "head on the first coin" and the event 
"both faces are same" are independent. 

Solution: 
When two coins are tossed, the sample space is 
S = {HH, HT, TH, TT} 

The first letter refers to the first coin and the second letter refers to the second 
coin. Let A denote the event "head on the first coin" and B denote the event "both 
faces are same". It is assumed that the four outcomes in S are equally likely. Thus 
probability of 1/4 is assigned to each pair. We have 


A { HH, HT}, P(A) 


IDO [bo 
tole ple 


B { HH, TT}, P(B) 


ANB =. {HH}, P(ANB)= 
P(ANB)= P(A) P(B) = 1/4 

‘Thus A and B are independent. 
Note: The students are advised to verify the following equations. 


@) P(ANB)= P(A)P(B) Gi) PAM B)= P(A) P(B) 

Gi) P(CANB)= P(A) P(B) 
Example 7.38. 

A six faced die is rolled. Show that the event "face is even" and the event "face 
is more than 4" are independent. 
Solution: 

When a die is rolled, the sample space is S = { 1, 2, 3, 4,5,6} 

Let A denote the event "face is even" and B denote the event "face is more than 
4" we have 


A 


CS lel 


{2, 4,6}, P(A) 


ul 
“th C3100 
wie Dole 


B {5,6}, PCB) 
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AnB= {6}, PANB)= 2 
1 
2 


pe 
P(AMB) = P(A)P(B) = 5% 3 = 4 
Hence A and B are independent events. 
Example 7,39. 4 


The probability is 2/3 that Mr. A will pass the examination and the probability 
is 3/4 that Mr. B will pass the examination. Find the following probabilities 


(i) both will pass (ii) only one will pass 
(iii) somebody will pass (at least one will pass). 
Solution: 


Let the symbols A and B denote "passing the examination" and A and B denote 
"failing the examination". 


We are given P(A) = 5 and P(B) 
2 salam tah 
se ery cr 
We shall assume in this case that Mr. A may or may not pass, but the result of 
Mr. B will not be influenced by the result of Mr. A. Thus A and B ore independent. 


Similarly A and B, Aand B, Aand B are also independent. 


Wehave P(A) = 1~-§ = 2 and P(B) 


@) P( both willpass) = P(AMB)= P(A).P(B) = A ae = - 

(ii) Only one will pass. This event includes "A pass and B fail" and "A fail and B 
pass". 
Thus P( only one will pass) = P(ANB)+P(ANB) 


P(A). P(B)+P(A).P(B) 
2,4,1,3 lp nig as 
a4 8) 2 By 64 12 
(iii) P( somebody will pass) = P( atleast one will pass ) 


= P(ANB)+P(ANB)+P(AMB) 
P(A) PB) + P(A) PCB) EE CADECE,) 

(5)(4)+(3)(2)-(@)(4) - 
Another Approach 


The complement of "somebod) will pass" is that "both will fail". We know that 
P( somebody will pass ) + P( no body will pass) = 1 


i 
' 
“4 
x 


u 


eg 
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Hence P( somebody will pass ) = 1 — P( no body will pass) = 1—P(ANB ) 
> = 1 1 11 
= 1=—P(A) PCB) = .1—- 3.*4 =. 
This result is the same as obtained above. 
Example 7.40. 


Two cards are drawn from a well-shuffled ordinary pack of 52 cards. The first 
card is replaced before selecting the second card. Find the following probabilities. 


@) both are kings (ii) first is king and second is queen 
(iii) one is king and one is queen (iv) both are faced cards 
(v) first is card of diamonds and the second is an ace. 

Solution: 


We shall assume here that all the 52 cards are equally likely and each card has 
the probability of 1/52. As the first card is returned to the main pack before selecting 
the second card, therefore the selection of the second card is independent of the first 
card, Thus the two cards are selected independently. 

(i) Both are kings 
There are four kings in a playing card, therefore 


P(first card is a king) = a = P(Ki) 
P(second card is aking) = = = P( Ka) 


K, and Ky are independent, 


Thus P( both are kings) = P(K, A K,) = P(K,) P(K,) = ae x = = = 


(ii) First is king and second is queen 


P(first isking) = 2 = P(K,) 


P( second is queen ) = 4 = P(Q,) 


ae i 1 
PCK, 9 Q,) SAMO ae eo era Sop 


(iii) One is king and one is queen 
This event is different from the event “first is king and the second is queen". 


Wehave P(firstisking) = A= P(K,) 


" 


P( second is queen ) = = P(Q,) 


B(K, A Q,) 


4.4 
P(K,) P(Q,) = 52%52 = Teo 
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Similarly 
P( first is queen) = 4s = PQ) 
ea 4 
P( second is king ) = bo = PK) 
4 4 1 
and P(Q, 9 K,) = P(Q,) P(K,) = b2 “52 7 i69 
"One is king and one is queen" includes the two events (K, 9 Q,) and (Q, 7 K,). 
as 1 2 


Thus P( one king, one queen ) = P(K, 9 Q,) + P(Q, 9 K,) = Teo * ieo= ico 


(iv) Both are faced cards 
There are 12 faced cards in a playing card. Therefore, 


P( first card is faced) = 52> 19= PCF) 
P( Ascorid card is faced ) = ae = . = PF) 
P( both are faced cards)= P(F,AF,) = P( F,) P(F,) = ae = - nal 


(v) First is card of diamonds and the second is an ace 
There are 13 cards of diamonds and 4 are aces 


P( first card is diamonds ) = re z = P(D,) 
P( second card is an ace) = $- 3: P(A.) 


P(D, A) = P(D,) P(A,) = 4 * 7397 59 

Example 7.41 
Three groups of children contain respectively 3 girls‘and 1 boy; 2 girls and 2 
boys; 1 girl and 3 boys. One child is selected at random from each group. Find 
the chance that the three selected consist of 1 girl and 2 boys. 

Solution: 


Let Gi = agirlis selected from group 1 P(Gi) = 3/4 
Gi = a girl is selected srom group 2 P(G.) = 2/4 
Gs = a girl is selected from group 3 (Gs) 1/4 
Bi = a boy is selected from group 1 ‘Bi) V4 
Bz = a boy is selected from group ? Be) 2/4 
Bs = a boy is selented from group o + Bs) 3/4 
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We have to find P(Gi 4 Bz 4 Bs) U P(Bi 4 G2 Bs) U P(Bi 4 B29 Gs) 
; = P(G:1) P(Bz) P(Bs) + P(Bi) P(Gz) P(Bs) + P( Bi) P( Bz) P(Gs) 
3)(2)\(3 1\(2)\(3 1\(2)\(1 
(QO@Q- OO +QO@ 
ee te eee 
64 
Example 7.42 
For two independent events A and B, P(A) = a and P(AUB) = B such that 


B >a. Then show that (B) = &—* 


1-a* 
Solution: 
Since A and B are independent events, therefore 
P(AUB)= P(A) +P(B)- P(A) P(B) 
B = a+P(B)-aP(B) or B = a+P(B)(1-a) 
P(B)(1-a) = B-a or P(B) = er : 
7.10, MULTIPLICATION LAW FOR DEPENDENT EVENTS 
Theorem: 
If A and B are two dependent events, then 
P(ANB)= P(A)P(B/A) = P(B)P(A/B) 


"i 


Proof: 
Suppose there is a sample space S containing n points (outcomes) which are 
equally likely. Let event A has m, favourable points, event B has m, favourable 


points and A B has m, favourable points. 


Thus, P(A) = “*>m>0 P(B) = 2?+m,>0 P(AMB)= “ 
m3_ ms .m_ m , ms 
P(ANB)= A toda en, en aay 
_ m . ms/n P(ANB) 
=n mtn = POA pea) 
= P(A)P(B/A) « [Since P(B/A) = ACB) 
Similarly, 
= ™_ ms, m_ m , my 
P(ANB)= n n aa on * ma 
_ m2. ms/n _ P(ANB) 
[am inphiie: PB) P(B) 


= P(B)P(AIB) | Since P(A/B) = MAO) 


P(B) 
Hence, PQPANB) = P(A) P(B/A) = PLB) P(A/B) 
This is called the genera! ruts of multi + = for probabilities. 
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Corollary: This rule can be extended to more than two events. For three events 
named A, B and C, we have ; 


P(ANBOC)= P(A) P(B/A) P(C/ANB) 

If A, B and C are independent, then P(AN BNC) = P(A) P(B) PCC) 
Example 7.43. ; 

A bag contains two defective D,, D, and three good G,, G,, G, bulbs. Two bulbs 
are taken, Find the following probabilities. 

(i) First bulb is defective and second is good 

(ii), One is defective and one is good. 

(iii) Both are good or both defective. 


Solution: 
() Let A represent the event "first is defective" and B represent the event "second 
is good", 


Clearly, P( first defective)=P(A) = 2/5 

We assume as if only one bulb has been selected. Now, we assume that event A 
has occurred. This means that one defective bulb has been taken out from the 
bag and the bag now has one defective and 3 good bulbs. Another bulb is 
selected from this bag. 

Thus P( second is good/first was defective) = P(B/ A) = 3/4 

We want the probability that first is defective and second is good. We apply the 
multiplication rule 


P(ANB)= P(A).P(B/A) = 2 noe d 
(ii) We write the probabilities as if the bulbs have been selected one by one. 
P( first defective ) = P(A) = 26 
P( second good/first was defective) = P(B/A) = 3/4 
at Sa an rae 
P(ANB) = P(A) P(B/A) Big ef OF Io See) 
Now,  P( first is good ) = 3/5= P(C) 
P( second defective/first was good) = 2/4= P(D/C) 
tS ee 
P(CAD) =:P(C).P(D/C) = 5% 4 =i9 = P(E) 


One defective and one good includes "first defective and second good" and "first 
good and second defective". From addition law, we get 


P(1 defective, 1 good) = P(ANB)+P(COD) = P(E,)+ P( By) 
B, and E, are mutually exclusive events 


3 3 6 3 
P(E, orE,)= P(E,UE,) = P(E,) + P(E.) = 79 +99) = 40> 5 
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@ii) P( first is good)= 3/5= P( A,) 


P( second good/first was good ) = P(A/A,)= 2/4 
3 2 3 
PCAYOA,) = P(A))-P(A/A) = § * 4 = qo = P(E) 


P( first defective ) = z = P(A) 


P( second defective/first was defective) = 1/4 


n 

~ 
2 
= 
<9 


Dia 32 
P(ASOA,) = PCAs) P(A As) = = XZ =G9= PCE,) 


"Both good or both defective" includes E, and E,. E, and E, are mutually 


exclusive events in the sample space of two bulbs. We apply addition law of mutually 
exclusive events. 


Thus P( both good or both defective ) 
= P(E,UE,) = P(E,)+P(E,) eS et ot 


10 10 10 
Example 7.44. 


In a certain teaching department, there are 4 lecturers and 4 assistant 
professors. Any three teachers are called upon to give a lecture turn by turn. Find 
the probability that a lecturer will come first and then the two assistant professors 
will come. 


Solution: 


Let A represent the event that first to deliver lecture is a lecturer, B represent 
the event that the second teacher is assistant professor and let C represent the event 
that the third teacher is also assistant professor. We have to calculate the 
probability of the joint event A B 4 C by the multiplication rule 

P(ANBOC) = P(A). P(B/A)P(C/ANB) 
Now P(A) = 4/8, P(B/A)= 4/7, P(C/ANB)= 3/6 


Hence P(ANBNOC) = 


Example 7.45 P, 


What is the probability of getting two consecutive aces in two cards drawn at 
random from an ordinary deck of 52 playing cards if 


(i) the first card is replaced before the second card is drawn 

(ii) the first card is not replaced before the second card is drawn 
Solution: 
@ = n(S) = 52C, = 52 

Let A be the event that first card drawn is an ace 


ie ee Y 
6 ~ 07 


A 4 1 
n(A) = 4Ci = 4 MA = Be 2-5 
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Let B be the event that second card drawn is also an ace 
Ps Ls 02) ie ae 
n(B) = 4C,; = 4 PR(B) = n(S) ~ 52713 
Since A and B are independent events, therefore 
sin 1 1 
P(AMB) = P(A) P(B) = ja 19) > 60 
Gi) n(S) = 8¢, = 52 
Let C be the event that first card drawn is an ace 
mae SRO) = dete 
n(C) = 4Q, = 4 P(C) = n(S) = 527 13 
Because the first card drawn is not replaced, therefore 
n(S) = 51Q; = 61 
Let D be the event that second card drawn is also an ace given that first card 


was an ace 

n(D/C) = 8C:1=3  P(D/C) = te mane + 
Since C and D are dependent events, therefore 

P(CAD) = P(C)P(D/C) = x % = oat 


7.11. SELECTIONS WITHOUT AND WITH REPLACEMENT 

The students are advised to clearly understand the difference between two 
types of selections of some elements from a certain lot. The lot may contain any 
material like bulbs, students, balls, cards etc. There are two methods of selecting a 
sample from a certain lot, One is called without replacement and the other is called 
with replacement. When only one element is to be selected from the lot, there is no 
difference in these two terms, When the second element is selected, the question is 
whether the first element has been returned to the main lot or not. If the first 
element is not returned to the main lot before selecting the second element, then the 
selection is called without replacement. Similarly the second element is not returned 
before taking the third element and so on, There is another special case in which the 
first element is returned to the main lot before selecting the next individual. This is 
called selection with replacement. 

7.11.1. WITHOUT REPLACEMENT 

There are three methods of calculating the probabilities when a sample is 
selected from a certain lot without replacement. We explain these methods with the 
help of a simple example, 

Example 7.46, 

A container has 3 good ( G,, G, and G,) and 2 defective (D, and D,) items. Two 
items are selected at random and without replacement. Find the following 
probabilities, 

(i) both are good, (ii) one is good and one is defective, 
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Solution: 
MethodI: By Sample Space 
This method consists of making the sample space and then writing the 
probabilities of the required sub-sets. The sample space S consists of all possible 
combinations. It should be remembered that we shall be required to make the 
permutations if the event is "first will be good and second wiil be defective". Here we 
have 5 objects and 2 objects are selected at a time. Thus the number of possible 
combinations (outcomes) are 
N, 5 Dees 
C= "C= a6-g1- 10 
The sample space of 10 outcomes is 
S = {G,G,G,G, G,G, GD, G,D,, GD, G,D,, G,D,, G,D,, D,D,} 
(i) Let A denote the event "both are good". Then 
A = {G,G, G,G,, G,G,} 
P(A) = Number of points in A Seaye 
(A) = Total number of outcomesinS ~ 10 
(ii) Let B denote the event "one good and one defective". Then 


3 B = {G,D,, G,D, G,D,, GD, GD, G,Dz} 
___ Number of points in B 6 


P(B) =. Fotal number of outcomes inS ~ 10 

It should be remembered that the event "one good and one defective" is also 
called "exactly one good" or "only one good". 
Note: In this question we have made the sample space of combinations. We shall 
get the same result if we write all possible permutations. This is left for an exercise 
for the students. ‘ 
Method II: Using Laws of Probability 

In this.method, we shall assume that the items are selected one by one. Let A 
be the event that the item on the first draw is good and B be the event that the item 
on the second draw is also good. 
@) P(A) = 3/5, | P(B/A) = 2/4 

Using multiplication law of probability, we get 


P(both god) = P(ANB)= P(A). P(B/A) 
Which is the same as obtained in Method I. 


(ii) P( first item is good ) = P(G)= i 
P( second item is defective /first item is good ) = 2/4 
P( first good and second defective ) = PP 
3 
5 


io = PE) 
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Similarly, 
P( first defective and second good) = P(DNG) = P(D).P(G/D) 
2.3 3 
= 5%4 = 97 PCB) 


We need the probabilityof the event "one good, one defective". This event 
includes E, and E,. 

Therefore, P(E,orE,) = P(E\UE,)= P(E) +(E,)= + = & 

The answer is the same as obtained in Method I. 
Method III: By Hypergeometric Distribution 

The first method is suitable when the size of the sample space is small and the 
second method is suitable when the sample size consists of a few elements. Method 
III can be used when the sample space is very large. We get the required 
probabilities as 


: 3Cq . 2 
(i) P( both good ) = GG . < 
(i) P( one good, one defective) = “yO, = & 


Students are advised to repeat this question when there are 30 good and 20 
defective items and two items are selected without replacement 
7.11.2, WITH REPLACEMENT 

When a sample is selected with replacement, we can calculate the probabilities 
by three different methods discussed in the example below. 
Example 7.47. a 

A container has 3 good ( G,, G, and G,) items and 2 defective items @, and D,). 
Two items are selected with replacement. Find the following probabilities. 

G) both are good, 

(ii) one is good and one is defective. 

( Example 7.39 has been repeated here for the con. \ience of students ) 

Solution: 
Method I: By Sample Space 

When 2 items are selected out of 5 items with replacement, the sample space S 
consists of 25 pairs 5” = 25, The sample space is 
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(i) Let A represent the event "both good" then 
{G,G,, G,G,, G,G,, ..., G,G,}. The event A contains 9 outcomes 


Number of outcomes in A 9 


Therefore, P(A) = ‘Total number of outcomesinS ~ 25 


(ii) Let B represent the event "one good, one defective", then B has 12 outcomes. 
Therefore, P(B) = 12/25 

Method II: Using Laws of Probability 
Here we shall consider that the items are drawn one by one. 

(i) Thus P( first item is good) = P(A) = 3/5 
and P( second item is good) = P(B) = 3/5 


As the Ist item is returned to the main lot, therefore the second item is selected 
from the container having the same original contents. We need the probability 
that both are defective. It is obtained by using multiplication law of independent 
events. 


Thus P(both good) = P(ANB)= P(A).P(B)= 3/5 x 3/5=9/25 
(a) P(first is good) = P(G) = 34 

P(second is defective) = P(D) = 

P(GND) P(G).P(D) 
3/5 x 2/5 = 6/25 = P(E,) 


i} 


Similarly, we have 
P(DAG) = P(D).P(G) 
=" 2/5 x 3/5 = 6/25.= P(E,) 
The event "one good, one defective” includes E, and E,. As E, and E, are 
mutually exclusive events, therefore 
P(E, orE,)= P(E,UVE,) = P(E,)+P(E,) = 6/25 + 6/25 = 12/25 
Method III: By Binomial Distribution 


This method is applicable in special cases when the main lot can be divided 

into two seperate groups. For example the good and defective items in a certain lot, 

+ intelligent and non-intelligent students in a class, smokers and non-smokers in a 

city and so on. This method shall be discussed in detail in chapter binomial 
probability distribution. 
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SHORT DEFINITIONS 
aa SPT Se TT SSSR ero oe a i TE 


Factorial As 
Given the positive integer n, the product of all the whole numbers from n down 
through 1 is called n factorial and is written as n! 

or 
Factorial is defined to be the product of all natural numbers less than or equal to a 
particular number. 
Permuiation 
A permutation is an arrangement in which the order of the objects selected from a 
specific pool of objects is important. 

or 
A permutation is an ordered arrangement of objects, 
Combination 
A combination is collection of a group of objects without regard to order, 

or 
A combination is an arrangement of objects without regard to order, 
Experiment 
An experiment is the observation of some activity or the act of taking some 
measurement. 

or 
An experiment ig-any process that allows researchers to obtain observations. 
Random Experiment 
A random experiment is an experiment where there is uncertainty about the 
occurrence ofan outcome. 

or 


A random experiment is any well-defined procedure that produces an observable 
outcome that could not be perfectly predicted in advance, 


Sample Point 
An individual outcome of an experiment is called a sample point. 
Outcome 
An outcome is a particular result of an experiment. 
or 
Each distinct possible result ofian experiment is called an outcome. 
Sample Space 
The collection of all possible outcomes for a random experiment is called the sample 
space. 
or 
The sample space for a random experiment is the set of all possible outcomes from 
that experiment. 
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Event 
An event is the collection of one or more outcomes of an experiment. 
or 
Any subset of a sample space is called an event. 
Simple Event 
A single outcome of an experiment is called a simple event. 
or 
A simple event is an outcome that cannot be broken down any more. 
Compound Event 
An event combining two or more simple events is called a compound event. 
or 
If an event contains more than one sampie points, it is called a compound event. 
Venn Diagram 
Diagram used to portray a sample space or a relationship between events is called 
venn diagram. 
or 
A venn diagram is a graphical device for representing symbolically the sample space 
and operations involving events. 
Tree Diagram 
A method for illustrating the sequence of several events is called a tree diagram. 
or 
A tree diagram is a graphical device helpful in defining sample points of an 
experiment involving multiple steps. 
Equally Likely Events 
Events that have the same probability of occurrence are called equally likely events. 
or 
Two events are equally likely when there is no reason to expect that one event has 
more chance to occur than the other event. 
Mutually Exclusive Events 
Two events are said to be mutually exclusive if they have no outcomes in common. 
or 
Two or more events that cannot occur together on a single trial of an experiment are 
mutually exclusive events. 
Exhaustive Events 
A set of events is said to be collectively exhaustive if their union is equal to sample 
space S. 
Completely Exhaustive Event 
An event is said to be completely exhaustive if it is alone equal to the sample space S. 
Probability 
Probability is the likelihood or chance that a particular event will occur. 
or 
Probability is a number between zero and one that is used to measure uncertainty. 
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Basic Properties of Probability 

(i) The probability of an event is always between 0 and 1. 

(ii) An event which cannot occur has the probability zero. 

(iii) The probability of any event that is certain to occur is one. 

(iv) The sum of all the associated probabilities must be one. 

Classical or Mathematical Definition of Probability 

Classical probability is applied when the possible outcomes of an experiment are 
equally likely to occur. If an experiment can produce N mutually exclusive and 
equally likely outcomes out of which n outcomes are favourable to the occurrence of 
event A, then the probability of event A is 


P(A) = Number of outcomes favourabletoA — nh 
(A) = Total number of possible outcomes ~ N 


Relative Frequency or Statistical or Empirical Definition of Probability 
This definition is based on the actual observations when the random experiment is 
repeated a large number of times under similar conditions. If an experiment is 
repeated N times where N is large and the event A is observed n times, the 
probability of A is 


P(A) = Limit 
Noo 

Subjective Definition of Probability 
According to this definition, the probability of an event is the degree of confidence or 
belief on the part of the statistician or decision maker that the event will occur. 
Conditional Probability 
A conditional probability is the probability that an event will occur given that 
another event has already occurred. or 
The probability associated with the reduced sample space is called conditional 
probability. 
Independent Events 
Two events are said to be independent if the occurrence or nonoccurrence of one 
event does not affect the probability that the other will occur. or 
Two events A and B are independent if the probability of the second event B is not 
affected by the occurrence or nonoccurrence of the first event A. 
Dependent Events 
Two events are dependent if the occurrence of one event impacts the probability of 
the occurrence of the other event. or 
Two events are said to be dependent events if occurrence of one event affects the 
occurrence of other event. 
Statistical Independence 
If events A and B are stati8tically independent, the probability of the occurrence of 
one event is not affected by the occurrence of the other. That is, P(A/ B) = P(A) and 
P(B/ A) = P(B). 


310 Basic Statistics Part-I 


Statement of Addition Rule for Mutually Exclusive Events 
If A and B are two events, the probability that either event A or event B occurs 
equals the probability that event A occurs plus the probability that event B occurs. 
In symbols, P(A or B) = P(A) + P(B). 
Statement of General Addition Rule 
Given two events A and B, the probability that event A or event B or both occur is 
equal to the probability that event A occurs, plus the probability that event B occurs, 
minus the probability that both events occur. In symbols, 

P(A or B or both) = P(A) + P(B) — (ANB). 
Statement of Multiplication Rule for Independent Events 
If A and B are two events, the probability that both event A and event B occur 
equals the probability that event A occurs into the probability that event B occurs. 
In symbols, P(A and B) = P(A) P(B). 
Statement of General Multiplication Rule 
The probability of the joint occurrence of event A and event B is equal to the 
probability of A into the conditional probability of B given A or probability of B into 
the conditional probability of A given B. In symbols, 

P(ANB) = P(A) P(B/A) = PB) P(A/B). 


MULTIPLE - CHOICE QUESTIONS 


1. When the possible outcomes of an experiment are equally-likely to occur, then 


we apply: 
(a) relative probability (b) subjective probability 
(c) conditional probability (d) classical probability 
2. Anumber between 0 and 1 that is used to measure uncertainty is called: 
(a) random variable (b) trial 
(c) simple event (d) probability 
3. Probability lies between: 5 
(a) -land+1 (b) Oand1 
(c) Oandn (d) Oandoo 
4. Probability can be expressed as: 
(a) ratio (b) fraction 
(c) percentage (d) all of the above 
5. The probability based on the concept of relative frequency is called: 
(a) empirical probability (b) statistical probability 
(c) both (a) and (b) (d) neither (a) nor (b) 
6. The probability of an event cannot be: 
(a) equal to zero (b) greater than zero 


(c) equal to one (d) less than zero 
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7. A measure of the chance that an uncertain event will occur: 
(a) an experiment (b) an event 
(c) a probability (d) a trial 

8. A graphical device used to list all possibilities of a sequence of outcomes in a 
systematic way is called: 


(a) probability histogram (b) venn diagram 
(c) pie diagram (d) tree diagram 
9. Arandom experiment contains: 
(a) at least one outcome (b) at least two outcomes 
(c) at most one outcome (d) at most two outcomes 
10. The probability of all possible outcomes of a random experiment is always 
equal to: 
(a) one ik (b) zero 
(c) infinity (d) all of the above 
11. The outcome of tossing a coin is a: 
(a) mutually exclusive event (b) compound event 
(c) certain event (d) simple event 
12. The result of no interest of an experiment is called: 
(a) constant (b) event 
(c) failure (d) success 
13. A set of all possible outcomes of an experiment is called: 
(a) combination (b) sample point 
(c) sample space (d) compound event 


14. The number of counting rules that are useful in determining the number of 
outcomes in an experiment are: 


(a) one (b) two 
(c) three (ad) four 
15. The events having no experimental outcomes in common is called: 
(a) equally likely events (b) exhaustive events 
(c) mutually exclusive events (d) Independent events 
16. A set of outcomes formed after some additional information is called: 
(a) sample space (b) reduced sample space 
(c) null set (ad) random experiment 
17. The probability associated with the reduced sample space is called: 
(a) conditional probability (b) statistical probability 
(c) | mathematical probability (d) subjective probability 
18. An arrangement of objects without regard to order is called: 
(a) permutation (b) combination 


(c) random experiment (d) sample point 
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21. 
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23. 


24. 


25. 


26. 


27. 


28. 


29, 
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The number of permutations of a sei of n things, taken r at a time with n > r is 


given by: 
n! n! 
(a) (n-r)! (b) (r-n)! 
! j ! 
(©) r! ae r)! (d) a 


If three candidates are selected to attend a course from the ten candidates, the 

number of ways of selecting the candidates is an example off 

(a) combination (b) permutation 

(c) reduced sample space (d) both (a) and (b) 

When each outcome of a sample space is as likely to occur as any other, the 
outcomes are called: 


(a) exhaustive (b) mutually exclusive 

(c) equally likely (d) not mutually exclusive 
If A is any event in S and A its complement, then P(A) is equal to: 
(a) 1 (b) 0 

(ec) 1-A (d) 1-P(A) 

When certainty is involved in a situation, its probability is equal to: 
(a). zero (b) between-1land+1 

(c) between 0 and 1 (d) one 

Which of the following cannot be taken as probability of an event? 
(a) 0 (b) 0.5 

(c) 1 (qd) -1 

If an event contains more than one sample points, it is called a: 
(a) simple event (b) compound event 

(c) impossible event * (d) certain event 


When the occurrence of one event has no effect on the probability of the 
occurrence of another event, the events are called: 


(a) independent (b) dependent 

(c) mutually exclusive (d) equally likely 

A particular result of an experiment is called: 

(a) trial (b) simple event 

(c) compound event (d) outcome 

A collection of one or more outcomes of an experiment is called: 
(a) event (b) outcome 

(c) sample point (d) none of the above 


A process that leads to the occurrence of one and only one of several possible 
observations is called: 

(a) random experiment (b) random variable 

(c) experiment (d) probability distribution 
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30. 


81. 


32. 


33. 


34. 


35. 


36. 


37, 


38, 


89. 


40. 


Which statement is false? 

(a) The classical definition applies when there are n equally likely outcomes 
to an experiment. 

(b) The empirical definition occurs when number of times an event happens 
is divided by the number of observations. 

(c) Asubjective probability is based on whatever information is available 

(d) The general rule of addition is used when the events are mutually 


exclusive 
The term 'sample space’ is used for: 
(a) all possible outcomes (b) all possible coins 
(c) probability (d) sample. 
The term 'event! is used for: 
(a) time (b) a sub-set of the sample space 
(c) probability (d) total number of outcomes. 
The six faces of the die are called equally likely if the die is: 
(a) small (b) fair 
(c) six-faced (ad) round. 
If we toss a coin and P(H) = 2P(T), then probability of head is equal to: 
(a) 0 (b) 1/2 
() 1/3 (@) 2/3 


A letter is chosen at random from the word “Statistics”. The probability of 
getting a vowel is: 

(a) 1/10 (b) 2/10 

(c) 3/10 (d) 4/10 

An arrangement in which the order of the objects selected from a specific pool 
of objects is important called: i 

(a) combination (b) permutation 

(c) factorial (d) sample space 

Two books are to be selected at random without replacement out of four books 
The number of possible selections are: 

(a) 4 (b) 2 ’ 

() 6 (d) 3 

Three books of different colours are to be arranged in a book-shelf. he possible 
arrangements are: 


(a) 3 (b) 1 

() 6 (a) 2 ; 

If a sample S = { 1, 2}, the number of all possible sub-sets are: 

(a) 2 (b) 1 

(©) 3 (d) 4 

When a die and a coin are rolled together, all possible outcomes are: 
(a) 6 (b) 2 


(©) 36 @) 12 
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41. When two coins are tossed, the possible outcomes are: 


(a) 2 (b), 4 
(ec) 1 (d) none of them. 
42, If three coins are tossed, the possible outcomes’are: 
(a) 8 (b) 3" 
(c) 1 (ad) noné% on them. 
43. Ifn coins are tossed, the possible outcomes aré: 
(a) n (b) 2 
(c) 20 (d) all of them. 
44, If two dice are rolled, the possible outcomes are: 
(a) 6 (b) 36 
() 1 (d) difficult to answer. 
45. When n dice are rolled, the possible outcomes are: 
(a) 6 (b) 6 
i) (d) 18. 


46. When one card is selected at random from a pack of 52 playing cards, the 
possible selections are: i F , 

(a) 104 (b) 52 
(©) 520 (a) 2704 

47. ‘Two cards are selected at random with replecement froma Packt of 52 playing 
cards. The possible outcomes are: 

(a) 52x 52 (b) 52 
(c) 1826 qd 2 

48. A bag contains 4 white and 2 black balls of the same : size and weight, and two 
balls are selected at random without replacement, thé possible selections are: 
fa) 6 , . (b) 4 
(©) 36 @ 15 

49. Two balls are selected at random with replacement, fre ~ a bag containing 3 
red, 3 black and 2 green balls. The possible outcomes are: 

(a) 8 (b) 64 
() 16 (d) 2 ii 

50. Five cards are selected at random from a pack of 52 cards with replacement. 
The possible combinations are: 

2 (a) 52 (b) (52)6 
(c) 52x 52 (d). (5) 

G1, The digits 1, 2, 3, 4, 5 are the roll numbers of 5 students. These roll numbers 
are written on the paper slips and two paper slips are selected at random 
without replacement. The possible combinations-are; 

(a) 5 ) 2 
(ec). 25 dad) 10 
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52. 


53. 


54, 


55. 


56. 


57, 


58. 


59. 


60. 


61. 


62. 


Which is the impossible event when a die is rolled? 

(a) 2o0r8 (b) 5or6 

(cy (d) Oor7 

The probability of drawing any one spade card is: 

(a) 1/13 (b) 1/4 

(ce) 4/13 (a) 1/52 

A balance die is rolled, the probability of getting an odd number is: 
(a) 1/2 (b) 1/4 

() 1/6 (d) 1/3 

Two fair dice are rolled, The probability of throwing an odd sum is: 
(a) 1 (b) 1/2 

(c) 1/6 (d) 1/386 


Given P(A) = 0.4, P(B) = 0.5 and P(AUB) = 0.9, then: 

(a) Aand B are not mutually exclusive events 

(b) Aand B are equally likely events 

(c) Aand B are independent events 

(d) Aand B are mutually exclusive events 

If P(B/A) = 0.50 and P(AMB) = 0.40, then P(A) will be equal to: 
(a) 6.40 (b) 0.50 

(c) 0.80 Ka) od 

Which of the following statements is incorrect: 


(a) A-(BUC) =(A-B)n(A-C) (6+) (AUB) =ANB 


315 


() (ANB) =AUB (a) A-(BNC) =(A+B) U(A-C) 
If P(A/B) = P(A) and P(B/A) = P(B), then A and B are: 

(a) mutually exclusive (b) dependent 

(c) equally likely (d) independent 

A fair coin is tossed 100 times, the expected number of heads is: 

(a) 100 (b) 50 

(c) 30 (d) 60 

When two dice are rolled, the maximum total on the two faces of the dice will 
be: 

(a) 6 (b) 36 

(c) 12 (d) 2 


A random sample of 200 random digits is selected from a random number 


table, Expected number of zeros in the sample is: 
(a) zero (b) 10 
(ec) 20 (d) 5 
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63. Six digits are selected at random again and again from a random number table 
and the even digits are counted each time. In most of the cases, the number of 
even digits will be: 


(a) 2 (b) 3 a 
() 4 (dd) 6 
64. Two events A and B are called mutually exclusive if: 
(a) AUB=o (b) ANB=@ 
(© ANBeES (dd) ANB=1 
65. If A and B are two mutually exclusive events, then: 
(a) P(ANB)=0 (b) P(ANB)=1 
(c) PAUB)=0 (dq) P(ANB)=S. 
66. When A and B are two non-empty and mutually exclusive events, then: 
(a) P(AUB) = P(A) . P(B) (b) P(AUB) = P(A) + P(B) 
(c) P(AMB) = P(A) . P(B) (d) P(AMB) = P(A) + P(B) 
67. The two events A and B are called not mutually exclusive events if: 
(a) ANB=o (b+) AnB¥o 
() AUB=o (d) AMB=zero 
68. IfA and B are disjoint events then the statement which is always true is: 
(a) P(A/B) =0 (bt) P(AUB)=0 
(c) P(ANB)=1 (d) P(A) =P(B) 
69. The events A, B and C are called exhaustive events if: 
(a) AUBUC=S (b) ANBoC=S 
() AUBUC=o (dd) AUBUC=zero. 


70. IfA and B are not-mutually exclusive events, then: 
(a) P(Au B) + P(AMB)= P(A) + P(B) (b+) PAU B) = P(A) + P(B) 


(©) P(AUB) = P(A) P(B) (d) PAB) = P(A) + PB) 
71. Ifanevent Ais the complement of the event A, then: 

(a) AUA=S () AnA=S 

@ AUVA=o9 (a) P(A)= P(A) 


72. IfAj, Ag, Ag, «++, Ay are k mutually exclusive events, then: 
(a) P(A, UA, U Ag U ++ U Ay) = P(A,) + P(A,) + P(A) + = + P(A,) 
(b) P(A, UAZU = UAQ) > 1 
(c) P(A,M ARAN AY) = 1 
(2) PCAN ALA Ay) = PCAYU Ag UU Ay). 


73. IfA is an empty set and B is 2 non-empty set then: 
(a) ANB=S (b) ANB=B 
() -AUB=B (da) P(A)=P(B) 
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74. IfAis an empty set (9) and S is the sample space then: 


(a) P(AUS) = P(S) (b) P(AUS) =P) 
(c) P(ANS)=1 (d) P(AUS) = zero 
75. IfA and B are independent events, then: 
(a)’ PAU B) = P(A) P(B) (b) P(AMB) = P(A) P(B) 
(c) P(ANB) = P(A) + P(B) (d) P(A) = P(B) 
76. IfA and B are two independent events, then: 
(a) P(A/B) = P(A) (b) P(A) = P(B) 
(c) P(A) < P(B) (a) P(A/B) = P(B/A). 
77. AandB are two independent events. Which one of these equations is false? 
(a) P(AMB)= P(A) P(B) (b) P(AN B) = P(BO A) 
(c) P(AW B) = P(A) P(B) (d) P(A UB) = P(A) P(B) 
78. The conditional probability of the event A when event B has occurred is 
denoted by: 
(a) P(A+B) (b) P(A-B) 
() P(A/B) (@) P(A) 
79, IfA and B are any two events, then P(A/B) + P(A/B) is equal to: 
(a) 0 (b) 0.25 
(c) 0.5 (qd) 1 
80. IfAis an arbitrary event, then P(A/A) is equal to: 
(a) zero (b) one 
(c) infinity (d) less than one 
81. IfA and B are any two events, then P(A/B) is equal to: 
(a) P(A/B) (b) 1-P(A/B) 
(ce) 1+P(A/B) @) PAM B) 
82. IfA and B are any two events, then P( AU B ) is equal to: 
(a) 1+P(ANB) (b) 1-P(AUB) 
(ce) 1-P(ANB) (d) P(A) + P(B) 
83. IfA and B are any two events, then P(A 7B) is equal to: 
(a) 1-P(AUB) (b) 1-P(AMB) 
(e) 1-P(ANB) (d) 1-P(ANB) 


84. Which of the following statements is correct: 
(a) AN(BUC)=(ANB)U(CANC) 
(b) AU(CBAC)=(ANB)A(AUY C) 
@© AN(BOC)=(AuUB)UCBOAC) 
(d) AA(BUC)=A+(BOC) 
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85. If A and B are two mutually exclusive and exhaustive events and 
P(A) = 2P(B), then P(B) is equal to: 


(a) 1/2 (b) 2/3 
(©) 1/3 (d) 71/4 
86. Two coins are tossed. Probability of getting head on the first coin is: 
(a) 2/4 (hi 
(c) zero (d) 4 
87. Adie and a coin are tossed together. Probability of getting head on the coin is: 
(a) 6/12 (b) 6 
(c) 12 (d) zero 
88. A fair die is rolled. Probability of getting even face given that face is less than 5 
is given by: 
(a) V2 (b) 5 
(ce) 2 (d) 6 
89. Two coins are tossed. The probability that both faces will be matching is given 
by: 
(a) 1/4 (b) 1/2 
@ 1 (ad) zero. 


90. Two coins are tossed. Probability of getting two heads given that there is at 
least one head is given by: } 


(a) 1/2 (b) 1/3 
(c) 1/4 (d) 2/3 
91. A fair die is rolled. Probability of getting more than 4 or less than 3 is given by: 
(a) 2/8 (b) 3 
(c) 1/2 (d) 4/3 
92. A fair die is rolled. Probability of getting even face or face more than 4 is: 
(a) 1/3 (b) 2/3 
(c) 1/2 (@) 5/6 
93. Two dice are rolled. Probability of getting similar faces is: 
(a) 5/36 (b) 1/6 
() 1/3 (d) 1/2 


94. Two dice are rolled. Probability of getting a total less than 4 or total more than 
10 is given by: 


(a) 10/36 _ (b) 4/36 
() 1/6 (d) 14/36 
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95. Two dice are rolled. Probability of getting a total of 4 given that both faces are 


similar is: 
(a) 5/86 (by) 1/36 
(ce) 4/36 (a) 1/6 


96. If A and B are two not-independent events, then the probability that both A 
and B will happen together is: 
(a) P(AMB) = P(A) P(B/A) (b) P(AMB) = P(A) P(B) 
(c) P(AMB) = P(A) + P(B) (da) P(ANMB) = P(A) 
97. IfA and B are two dependent events, then: 
(a) P(A) P(B/A) = P(B) P(A/B) (b) P(A/B) = P(B/A) 
(c) P(A/B) = P(A) (d) P(A) = P(B), 
98, Which one is true? 


(a) P(ANB) = P(B)-P(AUB) (b) P(ANB) = P(A) -P(ANB) 
@ P(ANB) = P(B)-P(ANB) (d@) P(ANB) = P(A)+P(B). 
99. Given P(ANB) = 3/5, then (AUB )is: 


(a) 1/5 (b) 2/5 
() 3/5 qd) 1 
100. Given P(AMB) = 3/10, then P(AUB) is: 
(a) 7/10 (b) 1/10 
(c) 3/10 (d) 1, 
101. Given P(A) = 2/3, P(B) = 3/8 and P(AMB) = 1/4, then A and B are: 
(a) independent (b) dependent 
(ce) mutually exclusive (d) equally likely 
ANSWERS 


1. {2 @)| 3 @®] 4 @/[s ©/]6 @/7. (©) | 8. a) | 
9._b) | 10. (@) {a1 @) [42.@) [43. © [14. © [ 45. @ | 16.) | 
17, (a) | 18. _(b) | 19. (a) | 20. (a) [ 21.) [22, (@) | 28. @) |) 24. (a) 

5 - (a) | 27. (a) | 28. (a) [ 29.) [ 30. @ | 31. (a) | 82.) 
33. _(b) | 34._(@) | 35. (c) | 36. (b) [ 37. (o) [38. © | 39. (a) | 40. (a) 

; -_(c) | 44. 45. (a) | 46, (b) [ 47. (a) | 48, (d 
49._(b) | 50._(b) [1._(d) | 52. (d@) [53.(b) | 54. (a) [ 55.) | 56._(a) 
57._() | 58. (a) | 59._(@) | 60. (by | 61. @ [62._() | 63.) | 64. (b) | 
| 65. (a) | 66. _(b) [ 67. (b) | 68. (@) [69._ (a) | 70. (a) | 71. (a) | 72. (a) 
78. () {74._ (a) [75._() [ 76. (a) | 77. @) [78._@) | 79. (d) | 80. _(b) 
81._(b) | 82. _()_[83._(a) | 84. (a) | 85. @ [ 86. (a) | 87._(@) | 88._(a) | 

5 ._(b) | 93. (b) | 94, © [ 95. (a) [ 96. (a) 
97. (a) | 98.) [ 99.) [100_(@) [101 (a) 
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SHORT QUESTIONS 


Q.1 Explain the random experiment by giving suitable examples. 

Q.2 What is a random experiment and what are its properties? 

Q.3 Write short note on sample space. 

Q.4 What is meant by an event? 

Q.5 Distinguish between simple and compound events. 

Q.6 Define Venn diagram. 

Q.7 Explain the concept of equally likely events and give one example. 

Q.8 Explain the concept of mutually exclusive events. 

Q.9 Distinguish between mutually exclusive and not mutually exclusive events. 

Q.10 Define collectively exhaustive events. 

Q.11 Differentiate between combinations and permutations. 

Q.12 When are two events A and B said to be independent? 

Q.13 Explain the concept of dependent events. 

Q.14 Distinguish between independent and dependent events. 

Q.15 Why do we study probability theory? 

Q.16 Write down a definition of probability. 

Q.17 Differentiate between independent and mutually exclusive events. 

Q.18 Define the complement. 

Q.19 What is the sample space for the roll of a single six-sided dic? 

Ans. S={1, 2,3, 4,5, 6} 

Q.20 Write down the basic properties of probability 

Q.21 Write down the classical definition of probability. 

Q.22 Write down the relative frequency definition of probability. 

Q.23 Write down the three definitions of probability 

Q.24 Define addition law of probability. 

Q.25 Define conditional probability. 

Q.26 Define multiplication law of probability. 

Q.27 State the addition law of probabilities for mutually exclusive events. 

Q.28 State the addition law of probabilities for not mutually exclusive events. 

Q.29 Write the statement for multiplication law of probability for independent 
events. 


Q.30 What is meant by conditional probability? Give examples. 

Q.31 State the multiplication law of probabilities for dependent events. 
Q.32 State the multiplication rule for two independent events, Ei and Ez. 
Q.33 Define the classical method of probability. 

Q.34 Define the relative frequency method of probability. 

Q.35 Define the subjective method of probability. 
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Q.36 


Ans. 
Q.37 
Ans, 
Q.38 
Ans. 
Q.39 
Ans. 


Q.40 
Ans. 


Q.41 


Ans. 
Q.42 


Ans. 
Q.43 
Ans, 
Q.44 


Ans, 
Q.45 
Ans. 


Q.46 


Ans. 
Q.47 
Ans. 
Q.48 


Ans. 


Q.49 
Ans. 


Q.50 


Ans. 
Q.51 


Ans. 


A fair coin is tossed twice. What is the probability of obtaining exactly one 
tail? 

1/2 

Three coins are tossed. Count the total number of sample points in S, 

8 

If three coins are tossed, what is the probability of getting exactly two heads? 
3/8 

If three coins are tossed, what is the probability of getting at least two heads? 
V2 

If three coins are tossed, what is the probability of getting at most two heads? 
7/8 

If three coins are tossed, what is the probability of getting fewer than two 
heads? 

1/2 

A coin is tossed three times and the outcome is recorded for cach toss. Let A 
be the event that the experiment. yields two tails. List the sample points in A. 
{(H, T, T),.(T, H, T), (2, T, A) 

If we roll a balanced die, what is the probability of getting an even number? 
1/2 

Two dice are rolled. Count the total number of sample points in the sample 
space S. 

36 

A pair of dice is thrown. What is the probability that the sum equals 2? 

1/36 


A die is rolled two times. What is the probability that the sum of the numbers 
observed will be greater than 9? 


V6 

A pair of dice is rolled. What is the probability that the sum equals 7? 

V6 

Suppose two balanced dice are rolled. What is the probability that the sum of 
the dice is even? : 

V2 

What is the probability of rolling a eleven with a pair of balanced dice? 

1/18 

When a card is picked from a well-shuffled deck of 52 playing cards, what is 
the probability that it will be a black ace? 

1/26 

If a card is chosen from a well-shuffled deck of 52 playing cards, what is the 
probability of selecting a 10 of any suit? 

1/13 
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Q.52 Acard is selected from a deck of playing cards. What is the probability that 
the card is a king or queen? 

Ans. 2/13 

Q.53 A card is selected from a deck of playing cards. What is the probability that 
the card is a red or black? 

Ans. 1 

Q.54 A card is drawn at random from a deck of 52 playing cards, What is the 
probability that it will be an ace or a face card? 

Ans. 4/13 

Q.55 A card is drawn at random from a deck of 52 playing cards. What is the 
probability that it will be a spade or a face card? 

Ans. 11/26 

Q.56 What is the probability of obtaining two kings in drawing two cards from a 
shuffled deck of cards? If the first card is not replaced before the second card 
is drawn. 

Ans. 1/221 

Q.57 Two cards are drawn at random from an ordinary deck of 52 cards. 
Determine the probability that both cards are aces, if the first card is 
replaced before the second card is drawn. 


Ans. 1/169 
Q.58 Consider drawing two cards from a pack of 52 playing cards: 
let EventA = the first card is a heart and 
Event B = the second card is a red card 
Calculate P(A and B). 


Ans. P(A) = 1/4, P(B/A) = 25/51, P( A and B ) = 0.123 

Q.59 Suppose P(A) = 1/3, P(A U B) = 1/2 and P(A B) = 1/10. Find P(B). 

Ans. 0.27 

Q.60 Suppose P(A) = 1/4, P(B) = 1/3 and P(A U B) = 1/2. Determine P(A 4 B). 

Ans. 1/12 

Q.61 Given P(A) = 1/3 and P(B) = 1/4. Suppose A and B are independent events, 
determine P(A UB). 

Ans. 1/2 

Q.62 If P(A) = 0.2, P(B) = 0.4 and P(A/B) = 0.375, find P(A and B). 

Ans. 0.15 Ez 

Q.63 Given P(A/B) = 1/5 and P(B/A) = 1/2. Find P(A)/P(B) 

Ans. 2/5 

Q.64 Given P(A) = 1/3, P(B) = 1/4 and P(A/B) = 1/6. Find P(B/A). 

Ans. 1/8 

Q.65 IfP(A)=0.7, P(B) = 0.5 and P(B/A) = 0.5, find P(A and B). 

Ans. 0.35 
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Q.66 
Ans. 


Q.67 


Ans. 
Q.68 


Ans, 
Q.69 


Ans, 


Q.70 


Ans. 
Q.71 


Ans. 
Q.72 


Ans. 
Q.73 
Ans, 


Q.74 
Ans. 


Q.75 


Ans. 
Q.76 


Ans. 
Q.77 


Ans. 


IfP( AandB )=0.4 and P(B) = 0.8, find P(A/B). 

0.5 

Given P(A) = 0.7 and P(B) = 0.6. If A and B are statistically independent 
then find P(A and B). 

0.42 

Given P(A) = 0.3, P(B) = 0.4 and P(A and B) = 0,20, Are A and B 
statistically independent? 

No, because P(A and B) # P(A) P(B) 

A coin is tossed four times and the outcome is recorded for each toss. List the 
sample points for the experiment. 

There are 16 equally likely outcomes: 

HHHH HHHT HHTH HHTT HTHH HTHT HTTH HTTT 
TH RET TAT UTR eta TOT TE Per 
Given P(A) = 0.4, P(A UB) = 0.7 and P( B ) = p. Suppose A and B are 
independent events, find the value of p. 

0.5 


Two coins are tossed. Determine the pr ‘ability of getting two heads given 
that there is at least one head. 


V3 

Given P(A) = 2/3, P( A UB ) = 7/12 and P(AMB) = 5/12. Suppose A and B 
are any two events, determine P(B ). 

1/3 

Given P(A) = 1/3, P(B) = 3/4 and P(AUB) = 11/12. Find P(B/A). 

1/2 

A fair die is rolled. Find the probability of getting even face or face more than 4. 
2/30 & 

A fair die is rolled. What is the probability of getting even face given that face 
is less than 5? 

V2 

Two dice are rolled. What is the probability of getting a total of 4 given that 
both faces are similar. 

1/6 

Given P( A) = 0,60 and P( B ) = 0.10. If A and B are not mutually exclusive 
events, then find PC AUB). 

0.64 
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EXERCISES 


Q.1 Two coins are tossed once. Find the following probabilities: 


@ exactly two heads (ii) exactly two tails Gui) at least one head 
(iv) atmosttwoheads (v) at least one tail (vi) at most one tail 
(vii) equal number of heads and tails (viii) no head 


Ans: (i) 1/4 (ii) 1/4 (iii) 3/4 (iv) 1 (vy) 3/4 (vi) 3/4 (wii) 1/2 (viii) 1/4 


Q.2 Three coins are tossed. What is the probability of getting (i) exactly 2 heads, 
(ii) at most 2 heads (iii) at least 2 heads (iv) greater number of tails than heads, 
(v) equal number of heads and tails. 


Ans. (i) 3/8 Gi) 7/8 Gi) 1/2 Gv) 12 =) 0 
Q.3 Two six sided dice are rolled. What is the probability that (i) the numbers 


appearing on both dice are same (ii) the sum of the numbers on two dice is 8 
(iii) product of the numbers on two dice is 6. 


Ans.(i) 1/6 (ii) 5/36 (iii) 1/9 


Q.4 Playing with two balanced dice, what is the probability that (i) total on the two 
dice is 2 (ii) total is minimum or maximum (iii) both faces are alike (iy) total is 
a complete square (v) product of faces is a complete square (vi) there is at least 
one ace, 


Ans. (i) 1/36 Gi) 1/18 Gi) 1/6 (iv) 7/36 (v) 2/9 (vi) 11/36 


Q.5 Show that in a single throw with two dice, the chance of throwing more than 7 
is equal to that of throwing less than 7. 


Ans. 15/36, 15/36 


Q.6 An income tax officer has received 10 files numbered from 1 to 10. He selects 
just one file for inspection. Find the probability that : 


(i) the file number is multiple of 5 (ii) the file number is multiple of 3 
(iii) the file number is multiple of 5 or multiple of 3. 
Ans.(i) 1/5 Gi) 3/10 (iii) 1/2 


Q.7 A card is drawn at random from an ordinary deck of playing cards. Find the 
probability that (i) card is red, (ii) card is an ace, (iii) card is of diamonds, 
(iv) card is 2, 3, 4, 5 or 6, (v) card is a red queen. 


Ans.(i) 1/2 (i) 1/13 (iii) 1/4 (iv) 5/13 (v):1/26 


Q.8 If three cards are chosen from a well-shuffled deck of 52 playing cards, what is 
the probability of selecting 


(i) 3 hearts (ii) no heart (iii) 3 red cards (iv) no red card? 
Ans: (i) 0.0129 Gi) 0.4135 Gi) 0.1176 (iv) 0.1176 
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Q.9 A card is selected at random from a well shuffled deck of 52 playing cards. 
Consider the following events: 
Let A = Ace card B = Face card C = King card 
(i) Show that A and B are mutually exclusive events 
(ii) Show that A and C are equally likely events 
(iii) Are B and C mutually exclusive events? 
(iv) Are B and C equally likely events? 
Ans: (i) Mutually exclusive (ii) Equally likely (iii) No (iv) No 


Q.10 Let A denotes a card of heart, B a faced card. A and B are their complementary 
events. If a card is drawn at random from a full deck, find the following 
probabilities 


(i) P(A) Gi) PA MB) Gidi) P(AUB) (vy) P(ANB) (vy) P(A UB) 
Ans. (i) 1/4 (ii) 8/52 (iii) 11/26 (iv) 5/26 (v) 48/52 
Q.114 coin is tossed twice. Let A be the event that at least one tail appears, B be 
the event that at most one tail appears and C be the event that exactly two 
tails appear. 
(i) Are B and C mutually exclusive events? 
Gi) Are A and B equally likely events? 
Gii) Are B and C collectively exhaustive events? 
Ans: (i) Yes (i) Yes (iii) Yes 
Q.12A die is thrown. Consider the following events: 
Let A = even number occurs B = prime number occurs 
C = odd number occurs D = number is divisible by 5 
@ Are A and B equally likely events? 
(ii) Are A and B mutually exclusive events? 
(iii) Are A and C mutually exclusive events? 
(iv) Are A and C collectively exhaustive events? 
(v) Which of the above events are simple and which are compound? 
Ans: (i) Yes (ii) No (iii) Yes (iv) Yes 
(vy) Simple event = D and compound events = A, Band C. 
Q,13 Two dice are thrown once. The following events are defined: 
A = sum of spots is 6 B = sum of spots is 8 
C = same numbers appear D = different numbers appear 
(i) Are A and B mutually exclusive events? 
(ii) Are A and C mutually exclusive events? 
ii) Are A and B equally likely events? 
(iv) Are C and D collectively exhaustive events? 
Ans:(i) Yes (ii) No (iii) Yes (iv) Yes 
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Q.14A bag contains 4 white and 2 black balls. Two balls are selected at random 
without replacement. Find the following probabilities after making the sample 
space of all possible outcomes. 

(@ both are white (ii) both have the same colour (iii) both have different colours. 

Ans. (i) 2/5 Gi) 7/15 Giii) 8/15 

Q.15A bag contains 7 white, 5 black and 4 red balls. If two balls are drawn at 
random, find the probability that (i) both the balls are white (ii) one is black 
and other is red. 

Ans. (i) 7/40 Gi) 1/6 

Q.16A committee of 4 people is to be appointed from 3 officers of the production 
department, 4 officers of the purchase department, 2 officers of the sales 
department and 1 chartered accountant. Find the probability of forming the 
committee in the following manner: 

(i) There must be one from each category. 
Gi) It should have at least one from the purchase department. 
(iii) The chartered accountant must be in the committee. 

Ans, (i) 24/210 Gi) 13/14 = @ii) -:2/5 

Q.17 An urn contains 8 red, 3 white and 2 blue balls. If 3 balls are drawn at random, 
determine the probability that (i) all 3 are red (ii) all 3 are white (iii) 2 are red 
and 1 is blue (iv) at least one is white (v) the balls are drawn in the order, red, 
white and blue. 

Ans. (i) 28/143 (ii) 1/286 Gi) 28/143 (iv) 83/143 (vy) 4/143 

Q@.18 Six cards are drawn at random from a deck of 52 cards. What is the probability 
that 3 are red and 3 are black? 

Ans. 0.332 

Q.19 If the probability of a horse A winning a race is 1/5 and that of a horse B is 
1/6, what is the probability that one of them wins? 

Ans. 11/30 

Q.20 A coin and a die are thrown together. Find the chance of throwing a head and 5 
or a tail and 6. 

Ans. 1/6 

Q.21 What is the probability of throwing either 7 or more than 10 with two dice? 

Ans. 1/4 

Q.22 Two fair coins are tossed. Let A denote the event "both faces are heads" and B 
denote the event "only one face is head". Are A and B disjoint events. Evaluate 
P(A vu B). 

Ans. Yes, 3/4 

Q.23 An integer is chosen at random from the first 200 positive integers. What is the 
probability that the integer chosen is divisible by 6 or 8? 

Ans, 1/4 
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Q.24 The integers 1, 2, 3, ---, 20 are written on slips of paper which are placed in a 
bowl and thoroughly mixed. A slip is drawn. What is the probability that the 
number on the slip is: 

(i) either square or divisible by 3? Gi) either prime or odd number? 

Ans. (i) 9/20 (ii) 11/20 


Q.25 An integer is chosen at random from the first 100 positive integers, Find the 
probability that the chosen digit is 


(i) multiple of 10 (ii) divisible by 8 (iii) divisible by 8 or 12. 
Ans.(i) 1/10 (ii) 8/25 (iii) 4/25 
Q.26 A pair of dice is thrown, find the probability that (i) the sum of the digits on 
turned up faces is either 6 or 9 and (ii) it is either a doublet or less than 5. 
Ans. (i) 1/4 (ii) 5/18 
Q.27 If A and B be events with P(AUB)= 7/8, P(AQB)= 1/4 and P(A) = 6/8. 
Find P(A), P(B) and P(A - B). 
Ans. P(A) = 3/8, P(B) = 3/4, P(A - B) = 1/8 
Q.28 The following contingency table is set up: 


B B 
A 10 20 


A 20 40 


Compute the following probabilities 


(Gi) eventA (ii) event B (iii) event A 

(iv) event B (v) event A and B (vi) event A andB 

(vii) event A and B (viii) event A and B (ix) event AorB 

(x) event A orB (i) event Aor B (xii) event AorB 
Ans: (i) 1/3 Gi) 1/3 Gi) 273 (iv) 2/3 (v) 1/9 (wi) 2/9 

(vii) 2/9 (viii) 4/9 (ix) 5/9 (x) 7/9 (xi) 7/9 (xii) 8/9 


Q.29 Two coins are tossed. What is the conditional probability that the two heads 
result, given that there is at least one head? 
Ans. 1/3 


Q.30 A fair die is rolled. Let A be the event that the face is prime and B be the event 
that the face is even. Find the following probabilities. 


@P(AUB) Gi)P(AM B)  Giii) P(A/B) (iv) P(B/A)._ (v) P(A/B) 
Ans.(i) 5/6 Gi) 1/6 (ii) 1/3 (iv) 1/3 (v) 2/3 
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Q.31A pair of dice is thrown. If the two numbers appearing are different, find the 
probability that (i) the sum is 6 (ii) the sum is four or less. 

Ans. (i) 2/15 (ii) _ 2/15 

Q.32 card is taken at random from playing cards, find the probability that card is 
a king, when it is given ( known ) that the card is 
@ red (ii) of diamonds (iii) faced. 

Ans.(i) 1/13 Gi) 1/13 (iii) 1/3 

Q.33In a firm, 20 percent of the employees have accounting background, while 5 
percent of the employees are executives and have accounting background. If an 
employee has an accounting background, what is the probability that the 
employee is an executive? 

Ans. 1/4 

Q.34 The probability that a regularly scheduled flight departs on time is P(D) = 0.83, 
the probability that it arrives on time is P(A) = 0.92, and the probability that it 
departs and arrives in time is P( D > A ) = 0.78. Find the probability that a 
plane (i) arrives in time given that it departed on time, and (ii) departed on 
time given that it has arrived in time. 

Ans. (i) 0.94 (ii) 0.85 

Q.35 Given the following contingency table: 


What is the probability of 


@) A/B? Gi) A/B? ii) AB? (iv) A/B? 
Ans: (i) 2/3 (i) 4/7 (ii) 1/3 (iv) 3/7 
Q.36 Two dice are rolled. Let E, denote the event of an odd total, E, the event of an 
ace on the first die and E, the event of a total seven. 
Gi) Are E, and E, independent? (ii) Are E, and E, independent? 
(iii) Are E, and E, independent? 
Ans.(i) independent (ii) dependent (iii) independent. 


Q.37 The probability that a boy will pass an examination is 3/5 and that for a girl it 
is 2/5. What is the probability that at least one of them will pass the 
examination? 


Ans. 19/25 
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Q.38 Two urns contain respectively 3 white, 7 red, 15 black and 10 white, 6 red and 
9 black balls. One ball is drawn from each urn. What is the probability that 
both balls are of the same colour. 

Ans, 0.3312 

Q.39The probability that a 50 year old man will be alive at 60 is 0,83 and the 
probability that a 45 year old woman will be alive at 55 is 0.87, What is the 
probability that a man who is 50 and his wife who is 45 will be alive 10 years 
hence? Also find the probability that at least one of them will be alive 10 years 
hence. 

Ans. 0.7221, 0.9779 

Q.40 Three missiles are fired at a target. If the probabilities of hitting the target are 
0.4, 0.5 and 0.6, respectively and if the missiles are fired independently, what 
is the probability that at least two missiles will hit the target? 

Ans, 0.5 

Q.41 A can hit a target 4 times in 5 shots, B can hit 3 times in 4 shots and C can hit 
twice in 3 shots. They fire a volley. What is the probability that the two shots 
at least hit? 

Ans. 5/6 

Q.42.4 husband and a wife appear in an interview for two vacancies of the same 
post. The probability of husband's selection is 1/7 and that of wife's selection is 
1/5. What is the probability that (i) both of them will be selected Gi) only one of 
them will be selected (iii) none of them will be selected. 

Ans.(i) 1/35 Gi) 10/35 Gii) 24/35 

Q.43 A and B play 12 games of chess out of which 6 are won by A, 4 are won by B, 
and 2 end in a tie. They agree to play a tournament consisting of 3 games. Find 
the probability that (i) A wins all three games (ii) two games end in a tie (iii) A - 
and B win alternately (iv) B wins at least one game, 

Ans. (i) 1/8 (ji) 5/72 (iii) 5/36 (iv) 19/27 

Q.44 Two cards are drawn from a well-shuffled ordinary deck of 62 cards. Find the 
probability that they are beth aces if the first card is 
(i) replaced (ii) not replaced 

Ans. (i) 1/169 (ii) 1/221 

Q.46 Three balls are drawn successively from a box containing 6 red, 4 white and 5 
blue balls. Find the probability that they are drawn in the order, red, white 
and blue if each ball is (i) replaced (ii) not replaced. 

Ans.(i) 8/225 (ii) 4/91 

Q.46 Find the chance of drawing an ace, a king, a queen anda jack in order from an 
ordinary pack in four consecutive draws, the cards drawn being not replaced 

382 

Ans. 312175 
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Q.47 Urn A contains five red and three white balls. Urn B contains two red and six 
white balls. (i) If a ball is drawn from each urn, what is the probability that 
both are of the same colour. (ii) If two balls are drawn from each urn, what is 
the probability that all the four balls are of the same colour? 


Ans. (i) 7/16 (ii) 55/784 
Q.48 If A and B are independent events and P(A) = 0.30 and P(B) = 0.60, find 


(i) P(A/B) Gi) ANB) ii) PPAUB) (iv) PAB) 

Ans: (i) 0.30 ii) 0.18 (ii) 0.72 (iv) 0.28 

Q.49 Given P(A) = 0.60, P(B) = 0.40, P(AMB) = 0.24. Find P(A/B), P(B/A) and 
P(A U B). What is the relationship between A and B? 

Ans. P(A/B) = 0.6, P(B/A) = 0.4, P(AUB) = 0.76, A and B are independent 


Q.50 Assume that events A and B are independent. That is P(A/B) = P(A) and 
P(B/A) = P(B). You are given the probability assessments that P(A) = 0.45 
and P(B) = 0.80. Find: (i) P(AMB) Gi) P(A UB). 

Ans. (i) 0.36 (ii) 0.89 

Q.51A bag contains 5 blue and 3 black balls and another bag contains 3 blue and 5 
black balls. One bag is selected at random and a ball is taken out of it. Find the 
probability that ball is black. 


Ans. 1/2 
Q.52A problem in statistics is given to three students A, B and C whose chances of 


solving it are 1/3, 1/4 and 1/2 respectively. What is the probability that the 
problem will be solved? 


Ans. 3/4 

Q.53 A, B, C, D cut a p&ck of cards successively in the order mentioned. What are 
their respective chances of cutting a spade first ? 

Ans. P(A) = 64/175, P(B) = 48/175, P(C) = 36/175, P(D) = 27/175 

Q.54 A, B and C in order toss a coin. The first one to throw a head wins. What are 


their respective chances of winning? Assume that the game may continue 
indefinitely. 


Ans. P(A) = 4/7, P(B) = 2/7, P(C) = 1/7 


Chapter 


8 


RANDOM VARIABLE AND 
PROBABILITY DISTRIBUTIONS 


8.1. INTRODUCTION 


The word random is used quite commonly in our daily life. One may or may not 
know its meaning but whenever it is used, it conveys the sense for which it is used. 
The customers enter a shop not according to some preplanned manner, but they 
enter the shop in a random manner. The vehicles cross a zebra crossing in a random 
manner. The teacher does not check the note-books of all students, he checks some of 
the note-books selected at random. Somebody is intelligent by birth, somebody is 
healthy by birth, somebody faces an accident on the road, somebody gets an attack of 
influenza, All these events occur randomly. When ever a coin is tossed or a die is 
thrown, the results happen in a random manner. Many situations in practical life 
are of random nature. Their ultimate results are based on chance. Even a young kid 
is also familiar with the classical idea of "lottery method", which is centuries old and 
has been used for the selection of a random sample. Most modern methods of 
selecting a sample are based on the theory of random selection by lottery. The 
random sample is the basis of the statistical inference. Thus randomness is the 
central idea of the study which is carried out to know something about unknown 
situations. 

8.2. GENERATION OF RANDOM NUMBERS . 

In our counting system, there are ten basic digits which are used for counting 
purposes. These digits are 0, 1, 2, ... 9. We can make numbers of any size with the 
help of these digits, The figure 53792 is made up of five digits 2, 3, 5, 7 and 9. We 
shall use these digits to make a set of numbers called table of random numbers. 
Suppose we select ten paper slips and on each slip we write a different digit. Thus 
each slip represents a digit. We select any one of these slips at random and note 
down its digit on a paper. We return the slip to the main lot and select a slip again. 
The digit on the second slip is also noted along with the first digit (row-wise) or 
below the first digit (column-wise). We continue this process of selecting, recording 
and replacing each selected slip. On each selection the probability of selection of 
each digit is 1/10. Thus each digit has equal probability of selection. We get a set of 
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digits called random digits. If the first digit is 5, second is 7, third is 5 and fourth is 
0, we can write them in a row as 5750 or 57 50. We can also write in a column as 
below: 


aa 


a 


° 


When the first row or column is completed, we can write the selected digits in 
the second row or second column. In this manner a table of any size spread over a 
number of pages can be obtained. This is called table of random numbers, One small 
table of random numbers is given below: 


Table 8.1, 
51 22 09 12 72 12 40 92 
3 45 35 50 23 39 74 44 
5 7 18 73 31 11 75 88 75 
92 69 46 75 56 82 Tail 66 
38 32 12 93 95 68 84 87 
95 ine | 80 36 82 16 48 38 


This table is written with two digits in two columns together. This is one way of 
writing the digits. One can write 3-digit or 4-digit columns. Anybody can make a 
table of random numbers. A good table of random numbers contains 0, 1, 2, ..., 9 
almost equal number of times. The students shall learn in higher classes that the 
random numbers can be made for each probability distribution. The random 
numbers under discussion, infact are the random numbers from a discrete uniform 
distribution over the interval (0, 9). 


8.3. HISTORY OF RANDOM NUMBER TABLES 


In actual practice, we do not have to make the table of random numbers. These 
tables are given in the books and we can consult them. The first table of random 
numbers was constructed by Tippett in 1927. The digits in the logarithms of various 
integers were used to make this table. Fisher and Yates prepared a table of 15000 
numbers. Another table of random numbers was constructed by Smith and Kendall 
in 1939. This was prepared by revolving a wheel on a machine. The wheel was 
stopped at random. The wheel was divided into ten equal parts and each part 
indicated a digit. In 1955, the Rand Corporation published a book "A million random 
numbers". One page of this random number table is given in end of this book. The 
Rand Corporation prepared the random numbers with the help of computer. We can 
also get the random numbers fron. usc electronic calculators which have the function 
‘Ran’ (for random ) on them. 
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8.4, APPLICATION OF RANDOM NUMBERS 


There are many uses of random numbers in statistics. One of the important use 
is in the selection of a simple random sample from a finite population. Suppose there 
are 80 students in a class and we want to select 8 students at random. The students 
can be numbered from 01 to 80. Then we consult the random number table. Let us 
see Table 8.1. We shall read two-digit columns. If any number is between 01 and 80, 
we shall note it down for the sample. If any random number is above 80, we shall 
ignore it because it is not in our population. We can read the random number table 
from any place. We may move column-wise or row-wise. Let us read the random 
numbers from Table 8.1. We read the first column. The random numbers are 51, 72, 
57, 38, 22, 45, 18, 69. Two random numbers 92 and 95 have been ignored because 
they are above 80. The remaining cight random numbers represent those eight 
students who have been selected for the sample. If the number of units in the 
population are in hundreds, say 800 we shall read three-digit column of the random 
number table. If we have 100 units in the population, we shall number them from 00 
to 99 so that we have to read 2-digit columns. If we number them as 001, 002, .., 
100, we shall have to read 3-digit columns. In three digit column, most of the 
random numbers will be above 100 and a lot of time will be wasted in getting the 
required size of the sample. Further use of random number table for the selection of 
a simple random sample will be discussed in a chapter on sampling and sampling 
distributions. 


Random number table can also be used to generate data without performing 
the actual experiment. Suppose we want to toss a coin 10 times to see the number of 
heads. We can do it by 


G) tossing the coin and counting the number of heads. 

(ii) By using random number table. The even digits 0, 2, 4, 6, 8 will stand for 
the head and the odd digits 1, 3, 5, 7, 9 will be for the tail. The first ten digits of the 
first row of Table 8.1. are reproduced below. H is for head and T is for tail 
Digits: 5 1 2 2 0 Cee 2 7 2 
Outcomé: «) Divi aIbArgekey WEben aE nin MP-wh- SH i Ga och 

There are 5 heads and 6 tails, It is just a chance that number of heads is equal 
to the number of tails. If we read the next row, the result in general would be 
different. 


This process of getting the results of an experiment from random number table 
is called simulation. In simulation the actual experiment is not performed. 


Example 8.1. 

Two balanced coins are to be tossed 10 times to record the number of heads 
each time. Use random number ‘table to record the possible observations. Write the 
frequency distribution of the observed number of heads. 

Solution: 

Two digits of a random number table will represent the result of a throw of two 

coins. We shall take ten pairs of random numbers for 10 throws of two coins. 
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From Table 8.1. we take 10 pairs of random digits and count the number of 
heads. Even digit will indicate head (H) and an odd digit will indicate tail (T ) 


Random pairs: Ole 1225+ OOM W122 7220 A2 AO 5792 002°9]0.0995 
Number of even digits 0 2 1 1 1 1 2 t 1 0 
No. of heads: 0 2 1 1 1 Bi 2 a a 0 


The observed frequency distribution of number of heads is 


If two coins are actually tossed 10 times, the observed frequency distribution 
may or may not agree with the above distribution. 
Note: If 3 coins are to be tossed, we shall take 3 digits to represent a throw of 3 
coins. 
Example 8.2. 

Assume that a fair die is to be rolled ten times. Without rolling the die, obtain 
the possible outcomes using random digits. 
Solution: 

Here the six digits 1, 2, 3, 4, 5, 6 will represent the six faces of the die. We shall 
ignore the random digit 0 and the digits above 6. 

From Table 8.1. we have the random digits 

5 BwQeegt 2 Leuba Zo 

Thus we assume that the first throw has given 5 on the die, 2nd throw has 
given 1 on the die and the 10th throw has given 4 on the dic. 
Example 8.3. 

Assume that a die is to be tossed ten times. Use random digits to find the 
number of successes where success is attained through 1. 
Solution: 

From Table 8.1. we take the random digits as below. The digit 1 is for success 
(S) and digits 2, 3, 4, 5, 6 are for failure ( F ) and the digit 0 and the digits above 6 
will be ignored. 
Random digits: 5 1 2 2 1 2 2 1 2 4 
Result: Peg Si) on Bog tSen HS ao 

Thus in 10 throws, the success has been achieved 3 times. 
8.5. RANDOM VARIABLE 

A set of numerical values assigned to the all possible outcomes of the random 
experiment is called random variable. The random variable can be briefly written as 
r.v. If we write A, B, ... , F on the six faces of a die, these letters are not a r.v. If we 
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write some numerical values on the six faces of a die like 1, 2, 3, ... , 6, we have a set 
of values called r.v. Suppose we select two bulbs from a certain lot having good and 
defective bulbs. Let G stand for good and D stand for defective. There are four 
possible outcomes which are GG, GD, DG and DD. Each outcome can be assigned 
some numerical value. Let us count number of defective bulbs in each outcome. We 


can write 
No. of defective bulbs 


Thus the numerical values 0, 1, 2 are the values of the random variable where 
random variable is the number of defective bulbs in this discussion. A random 
variable is denoted by a capital letter X. Here X is the number of defective bulbs. 
The small letters x,, x2, ... , X, are used for the specific values of the random 
variable. A random variable is also called chance variable. If we have two or more 
than two random yariables we can use the letters X, Y, Z for them. A random 
variable may be discrete or continuous. 

8.6. DISCRETE RANDOM VARIABLE 

A random variable X is called discrete if it can assume finite or countably 
infinite number of values. If two bulbs are selected from a certain lot, the number of 
defective bulbs may be 0, 1 or 2. The range of the variable is from 0 to 2 and random 
variable can take some sclected values in this range. The number of defective bulbs 
cannot be 1.1 or —1 or 3 etc. This random variable can take only the specific values 
which are 0, 1 and 2. When two dice are rolled the total on the two dice will be 2, 3, 
++, 12. The total on the two dice is a discrete random variable. 


8.7. DISCRETE PROBABILITY DISTRIBUTION 


Suppose a discrete random variable X can assume the values X1) Xz, Xyy aoe y Xq 
with corresponding probabilities p(x,), P(X2), p(xs), -.. , p(x,). The set of ordered pairs 
[ x1 pO) ], [xy PG) ], [ x, pss) J, ... , [x PX, ] is called the probability 


distribution or probability function of random variable X. A probability distribution 
can be presented in a tabular form showing the values of the random variable X and 
the corresponding probabilities denoted by p(x) or f(x). The above information can 
be collected in the form of a table below called the probability distribution of the 
random variable X. The probability that random variable X will take the value x; is 
denoted by p(x;) where p(x;) = P(X = Xi) 


| Values of random variable ( x) | Xx X2 X3 We xj obs Xp | 


| Probability p(x) [ey pe) pow Pw Pa | 
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The probability of some interval can be calculated by adding the probabilities of 
all the points in the interval. For example P[ x; < X < x,] = P( X= x2) + P( X=x,). 


This type of addition will not be possible in continuous random variable for 
finding the probability of an interval. Therein we shall use the integral calculus for 
finding the probability of an interval. _ 


The probability distribution can also be described in the form of an equation for 
f(x,) with a list of possible values of the random variable X. Some probability 
distributions in the form of equations are 


@ = fx) = é for x= 1) :2):8, a6: 


For each value of X the probability is 1/6. It is the probability distribution when 
a fair die is rolled. 


Gi) fx) = (2) (4) x : 0, 1, 2, 3. 


x 
It is the probability distribution when three fair coins are tossed. 
8.8. PROPERTIES OF THE DISCRETE PROBABILITY FUNCTION 
A discrete probability distribution or function f(x) has the following properties. 
@ f(x) is always non-negative. It lies between 0 and 1. Thus 0< f(x) < 1 


n 
Gi) The sum of all the probabilities is 1. Thus _ 2 ie) =] 
i= 


8.9. GRAPH OF THE DISCRETE PROBABILITY FUNCTION 
The discrete probability distribution can be presented in the form of a graph by 
taking the values x, X2, Xg, «+. , Xj, --- on X-axis and drawing vertical lines of heights 
equal to f(x,), f(x2), f(x), ... , £(x)), ... above them. This type of graph is called line 
graph. 4 
f(x) 


0 x 
xi x2 ae wes Xi Xiv) jee" 


Fig.8.1. Line graph of the probability distribution. 
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8.9.1, PROBABILITY HISTOGRAM 


The probability distribution can also be presented in the form of probability 
histogram. 


(x) 


0 ‘i -~X 
xX Xa Xs ge, Xi Xi 
Fig.8.2, Probability histogram 


Example 8.4. 

A digit is selected from the first 8 natural numbers. Write the probability 
distribution of X where X is the number of factors (divisors ) of the digits. 
Solution: 

It is assumed here that the probability of selection for each digit is 1/8. We can 
write: 


Factors Number of factors | Probability 
ssi) 


The information in the above table can be summarised as below in the form of 
table of probability distribution, 


The probability of X = 2 is 4/8 which has been obtained by adding 1/8 four 
times. Similarly the probability of 4 is 2/8 which has been obtained by adding 1/8 
two times. 
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A factory is producing bulbs out of which 25 % are defective. Two bulbs are 
selected from this factory for inspection. Write the probability distribution of 


number of defective bulbs. 
Solution: 


Let X denote the number of defective bulbs. Let D denote the defective and G 


denote the good bulb. Here 


25 1 
PO) Pasig e ok and P(G) 


ch aa) 


100° 4 


It is assumed here that the two bulbs are selected independently because there 
are very large number of bulbs in the factory. According to multiplication law of 


independent events, we have 


P( both defective ) = 
P( first defective and second good ) = 
P( first good and second defective ) = 


P( both good ) = 


The possible outcomes, the random variable X and the 


P(D,D.) 
PD,G,) 
P(G,D.) 


P(G,G,) 


probabilities are put in the following table : 


il 


PO) PO) = ¢*> = A 
PD) PG) = 1,3 4 2 
PG) PO) = 34 pe + 
PG) PG) = 3x3 = a 


corresponding 


Outcomes (S) 


Number of defectives (r.v.) [ Probability | 
a 
D,D. 2 4 = 16 


The above information can be collected as below in the form of a probability 


distribution of the random variable X. 
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Example 8.6, 


From an urn containing 4 red and 6 white round marbles, a man draws three 
marbles at random without replacement. If X is a random variable which denotes 
the number of red marbles drawn, what is the probability distribution of X? Draw a 
probability histogram. 


Solution; 
Red marbles White marbles Total marbles 
4 6 10 
S contains ( re = 120 sample points. 


If the random variable X denotes the number of red marbles, the possible 
values of x are 0, 1, 2, and 3, and their respective probabilities are: 


pA rag a, FL Pe mONG TT = 
(7) Gi 
4\/6 4)(6 

suet ae me LOM EY oe ag 
5) 3 


The probability distribution of red marbles in a tabular form is: 


x 0 1 2 3 
P(x) 5/30 15/30 9/30 1/30 


The probability distribution is shown as a histogram in Fig.8.3, 
p(x) 


0 1 
Fig.8.3. Number of red marbles. 


tw 
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Example 8.7, 
Given the discrete probability distribution: 


p(x) = (ar GS" for x= 0, 1, 2, 3, 4. 


Find the complete probability distribution and draw suitable graphs. 


Solution: 
4)\(1 1\4-x 
p(x) = (4)(8) (GG) x=0,1,2,3,4 


pcx=2) = ($)(3) (4) “% 7x=3) = (4)( y(A)-4 
nxen = (JOY) =4 


The probability distribution of X in a tabular form is: 


x 0 1 2 3 4 
p(x) V16 4/16 6/16 4/16 1/16 


p(x) Line graph of probability P(X) Probability histogram 
distribution of heads. 


6/16 


0} ee 3 oe fer Vacwe os: ~ 4 
Fig.8.4. (a) Number of heads * Fig.8.4.(b) Number of heads 
Example 8.8. 


A random variable X has the following probability distribution: 


Find: (i) k Gi) P(X<2) ii) (X22) Gy) P(-2<X<2) (v)P(X<1). 
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Solution: 
(i) Since the sum of probabilities is one 


3 
2 BWs 1 gives 


O.1+k+0.2+2k+0.3+3k=1 or 0.6+6k=1 or 6k=1-0.6=0.4 


0.4 ee arg 
so that k= 6 = 60 =) 15 


Gi) P(X<2) = P(X=~-2)+P(X=-1)+P(X=0)+P(X=1) 
= O.1+k+0.2+2k = 03+3k = 0.8+3(3¢) =0a+02=05 
(ii) P(X22) = P(X=2)+P(X=38)=0,3+3k 


1 = = 
= 03+3(7) = 0,340.2 =05 


(iv) P(-2<X<2) = P(X=-1)+P(X=0)+P(X=1)=k+0.2+2k=0.2+9k 
= o2+3(z5) =0.2+0.2=0.4 : 
() P(Xs1) = P(X=-2)+P(X=-1)+P(X=0)+P(X=1) 


= O.1+k+02+2k=03+9k=08+8( 75) =0.3+0.2=0.5 


8.10. CONTINUOUS RANDOM VARIABLE 

A random variable is called continuous if it can assume all possible values in 
the possible range of the random variable. Suppose the temperature in a certain city 
in the month of June in the past many years has always been between 35° to 45° 
centigrade. The temperature can take any value between the range 35° to 45°. The 
temperature on any day may be 40.15°C or 40.16°C or it may take any value 
between 40.15° and 40.16°. When we say that the temperature is 40°C, it means 
that the temperature lies somewhere between 39.5° to 40.5°. Any observation which 
is taken falls in an interval. There is nothing like an exact observation in continuous 
variable. In discrete random variable the values of the variable are exact like 10} .1, 2 
good bulbs. In continuous random variable the value of the variable is never an 
exact point. It is always in the form of an interval, the interval may be very very 
small, 

Some examples of the continuous random variable are 
(i) The computer time (in seconds) required to process a certain program 
(ii) The time that a poultry bird will gain the weight of 1.5 kg. 
(iii) The amount of rain fall in a certain city. 
(iv) The amount of water passing through a pipe connected with a high level 

reservoir. 
(v) The heat gained by a ceiling fan when it has worked for one hour. 
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8.11. PROBABILITY DENSITY FUNCTION 


The probability function of the continuous random variable is called probability 
density function or briefly p.d.f. It is denoted by f(x) where f(x) is the probability that 
the random variable X takes the value between x and x + Ax where Ax is a very very 
small change in X. 


If there are two points ‘a’ and 'b' then f(x) 
the probability that the random 

variable will take the value between a 

and b is given by the integral 


b 
P(asXsb) = f t&) d& fix) 
a 
where 'a' and 'b' are any points between o Me Bae * 
—o and + ». The quantity f(x) dx is Fig. Z i 


called probability differential. 


The number of possible outcomes of a continuous random variable is 
uncountably infinite. Therefore, a probability of zero is assigned to each point of the 
random variable. Thus P( X = x ) = 0 for all values of X. This means that we must 
calculate a probability for a continuous random variable over an interval and not for 
any particular point. This probability can be interpreted as an area under the graph 
between the interval from a to b. When we say that the probability is zero that a 
continuous random variable assumes a specific value, we do not necessarily mean 
that a particular value cannot occur. We, in fact, mean that the point (event) is one 
of an infinite number of possible outcomes. Whenever we have to find the probability 
of some interval of the continuous random variable, we can use any one of these two 
methods. 


(i) Integral calculus 


(ii) Area by geometrical diagrams ( this method is easy to apply when f(x) is a 
simple linear function ). 


8.12. PROPERTIES OF PROBABILITY DENSITY FUNCTION 
The probability density function f(x) must have the following properties. 
(i) It is non-negative ie. f(x) 2 0 for all x. 


) 
Sf) dx = 1 


-o 


Gi) Total Area 


c 
dj f(x) dx = 0, where c is any constant. 
c 


(ii) P(X=c) 


W 
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(iv) As the probability is zero for X =c (constant), therefore P(X =a) =P(X=b)=0. 
If we take an interval a to b, it makes no difference whether end points of the 
interval are considered or not. Thus we can write: 


P(asXsb) = P(a<X<b)= P(asX<b)= P(a<X<b) 


(v) P(asXsb) = FM) de = ff) ax f f(x) dx (a<b) 
a —-@ -o 
Example 8.9. 
Given a function f(x)=cx, Osx<2 
(a) Find the value of ¢ so that f(x) is a probability density function. 
(b) Find the probabilities ( B(X<1) i) P(F<x<3) 


Solution: 


a) f(x) will be a proper p.d.f. if the total area under the line and X-axis from 0 to 2 
is unity. The graph of f(x) is a right angled triangle. Its area is given by 


5 ie fix 
Aes ¢ cere x base x) 
2 
height =f(x)=cx =2c at X = 2 
base = 2-0=2 
Total Area has to be unity, therefore 
ge x2 
Area = en tae x 
i 1 2 
2c = 1, sothat c= 2 Fig.8.6.(a) 


1 
Thus f(x) = 2* 0x2 is proper probability density function. 
(b) G) P(X <1) is the shaded area of the right-angled triangle 
yt f(x) 
ee height x base x 


1 
base = 1-0 = re 
x 


1 2 


Fig.8 6.10) 
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The probability P ( <X< 3) is the area of the shaded portion of the 
diagram. The shaded area is a trapezoid and its area is given by 
Area of Trapezoid = (Average height ) (Base ) 

Sum of parallel sides —_(*) 


Average Height = 2 
3 1 

7 (3)+«(3) 

Ph 2 

_ g4+14_ 1 

didenb. Simewinie Zl 

: x 
Base = g.3 = v 12 aio)! 9 
Fig.8.6.(c) 
p(3<x<8) = AreaofTrapezoid = I/2x1= 1/2 


Example 8.10, 


A continuous random variable X which can assume values between x = 2 and 8 


inclusive, has a density function given by c (x + 3) where c is a constant. 


(a) Calculatec. (b)P(3<X<5) (©) P(X24). 


Solution: 


(a) 


(b) 


f(x) =c(xt+3),2<x<8 


@ 
f(x) will be a density function if (i) f(x) 2 0 for every x and (ii) S f(x) dx = 1. 
-2 

If c = 0, f(x) is clearly 2 0 for every x in the given interval. Hence for f(x) to be a 
density function, we have 

cs 8 ‘x2 8 

f #0) dx = f c(x+3)dx = [e+3x], 

-2 2 
2 ‘Q)2 

[2+ a@-S*" 3] = c[32+24-2-6]=c [48] 
sothat c = 1/48 
Therefore, f(x) = vA (x+8), 2<x<8 


1 


i] 


P(38<X<5) 


W 

Bl Ble ee 
Br 

e 

” 

a 

wo 

S 

& 

i} 
ae 
= 

to| 

S 

z 
we 


GY +3¢)-S_a@)| - Bet 15-$-9] 


[ah = 
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Sal: 1 

J 4g (%+3) dx= 4[£ +3 
4 4 
oh. 


Wt 


(ce) P(X24) 


. we +3@-9 ay -a)] = ¢ gp [82+ 24-8-12] 


= # [36] = 3 


Example 8.11. 
(a) Acontinuous random variable X has a density function f(x) = 2x when 0 <x <1 
and zero otherwise. Find (i) P (x <}) (ii) r(i <X< i) 


(b) If f(x) has probability density kx?, 0 < x < 1, determine k and find the 
probability that 1/3 < X < 1/2, 


Solution: 
(a) f(x) = 2x,0<xs1, 
= 0, otherwise. 
1 v2 1/2 ae 1) A 
(i) p(x<3) = f #@ ax = if xx = 2[ 2 = (4) -0| ag 


0 


y (2 Dye ae 12 x2]? 
(ii) ~P 1<X<3) = f %) dx f xa 2| =| 

v4 > V4 

[(aY-GYT = Gea) = a 

DB) TN Nig Si a Spats 
(b) f(x) will be a probability density function, if f f(x) dx = 1, ie. 
7° 
1 i i 

f f) dx = plat ax=k| | = [3 | = E | go that 


0 


k = 3. Hence the probability density function f(x) = 3x2, 0<x<1 
Now 


i i 
p(}<x<3] 


V2 i 1a a al ha 
i Kaos fe Se GENE 3 Jus 
U3 V3 


(@-Ol-Bal- & 
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8.13. MATHEMATICAL EXPECTATION 


Suppose a random variable X takes the n values as x), xX», Xj, ...» , X, With 
corresponding probabilities f(x,), f(x), f(x), ... , f(x,).The mathematical expectation 
or expected value of X denoted by E(X) is defined as 


E(X) 


n 
X1 (xq) + Xp fxg) + xq (xg) +... + Xp LX) = ey x; £(x)) 


i] 


Similarly E(X?) xe f(x,) + ~ f(x.) + xe £(X3) +... + xt f(x,) == x} £(x,) 


It is also called the mean of the discrete random variable X. If the random 
experiment is repeated a large number of times, most of the random experiments 
would generate the result equal to the expected value of X. 


8.14, FUNCTION OF A RANDOM VARIABLE 


When Y = aX + b, it is called linear transformation. The random variable Y is 
function of the random variable X. Expected valuc of Y can be written in terms of the 
expected value of X. 


Thus E(Y) = Ef[aX+b] = aE(X)+b 


It is important to note that the probability function of random variable Y will 
be the same as the probability function of random variable X. Thus f(x) = f(y). 


8.15. LAWS OF EXPECTATION 
G) E(Constant) = Constant 


If a constant 'c' is written on all the faces of the die, we shall always get c 
whenever die is rolled. Thus E(c) = ¢ 


Gi) E(aX) = aK(X) ax0 

@ii) E(X+a) = E(X)+a 

(iv) E(aX+b) = aE(X)+b. a0 

(vy) E(X+Y) = E(X)+E(Y) 

(vi) E(X-Y) = E(X)-E(Y) 

(vii) EC XY) = E(X)E(Y) if X and Y are indepencent. 


(viii) E[[X-E(X)] = E(X)-E(X) = 
Example 8.12. 
Let X have the following probability distribution: 


Cis 


(i) Find E( X), E( X?) and E( X+4) 
(ii) Verify that E( 2X + 3)=2F(X)+3 
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Solution: 
The necessary calculations are given below: 


(i) Se = Su 
E(.X*) =x? p(x) 


E(X +4) = 2(x+4) p(x) 


(ii) E( 2X +3)= 3 (2x+3) p(x) = 9.6 
2E(X)+3= 2(3.3)+3 = 66+3= 9.6 
Hence, E( 2X +3) = 2E(X)+3 
Example 8.13. 
Two unbiased dice are thrown. Find the expected value of the sum of numbers 
of points on them. 
Solution: 


Clearly X may be at least 2 and at the most 12. If x, x9, Xs, +.) X;, are the values 
of X with probabilities p,, pz, ps, .--, pr respectively corresponding to X = 2, 3, 4, ..., 
12, then the probabilities with variate-values may be tabulated as below: 


E( X) = =x p(x) = 252/36 = 
Example 8.14. 

From an urn containing 3 red and 2 white balls, a man is to draw 2 balls at 
random without replacement, being promised Rs.20 for each red ball he draws, and 
Rs.10 for each white one. Find his expectation. 

Solution: 
Red balls White balls Total balls 
urn: 3 2 5 


5 
S contains ( ) = 10 sample points 


Let x; = tworedballs = Rs.40 
xX, = one red and one white ball = Rs.30 
X; = twowhite balls = Rs.20 
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The respective probabilities are: 


p(x,) = P( two red balls ) 4 “(ey -2 
2 
3)(2 
p(x,) = P(one red and one white bal 1) = ty = < 
2 
3\(2 
p(x;) = P( two white balls ) = ty = 6 
2 
E(X)= x, p(x) +x, p(x) + xa pe) = ao(3 +30 §)+20(35) 
+= 12+18+2 = Rs.32 


Example 8.15. 

A and B throw one die for a prize of Rs.77 which is to be won by the player who 
first throws 3. If A has first throw what are their respective expectations? 
Solution: 


Here, p = 2 and q = 1-p = 6 

Ahas the first throw. Therefore A can win in the first trial. If A does not win on 
the first trial, the second trial will go to B. If B does not win on the second trial, the 
third trial will go to A and so on. Thus A can win on Ist, 3rd, 5th ... trials and B can 
win on 2nd, 4th, 6th ... trials. 
Trials: 1 3 
Chances of A: p pq? 

1 1 5 \e 1 

Chances of A: (3) (2)(3) (3 


A may win in any of the trials given to him. 


SSO ROURUOE 


7 
4 pq® 


a) (e)(8y 
Pt isan 


mie ; bes : 1 5\2 _ 25 
This is a geometric progression in which a = 6 and r= 6) = 36 
a 1/6 1/6 6 
FCA) = aun Serer manne = eniaae se 11 
Trials: 2 4 6 8 
Chances of B: pq pq? pq pq’ 


comers (18) (EY (EY 


B may win in any of the trials given to him. 


(5)(¢) 


Pp 
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vay = ()(8)+(aXe)+(ele)* Cele) + tom 


This is also a geometric progression in which 


165 5 5) 25 
a= (Z)G)- & 2 + (GS-8 


a 5/36 6/86 
P(B) = Sum = 7; = 7-25/36 = 11/36 = 11 


Probability of B can also be obtained from the equation P(A) + P(B) = 1 
Thus P(B) = 1-P(A) = 1-6/11 = 5/11 
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Sothat E(A) = 77x 6/11 = Rs42 and E(B) = 77x 6/11 = Rs.35 


Example 8.16. 


In a summer season, a dealer of desert room coolers can earn Rs.800 per day if 
the day is hot and can earn Rs.300 per day if it is fair and loses Rs.40 per day if it is 
cloudy. Find his mathematical expectation if the probability of the day being hot is 


0.50 and for being cloudy it is 0.30. 


‘Solution: 


Let the random variable X denote the number of rupees the dealer earns. Then 
the possible values of x are 800, 300 and —40, where — 40 corresponds to the fact that 


dealer loses, and the respective probabilities are 0.50, 0.20 and 0.30. Therefore 

E(X) = 800 (0.50) + 300(0.20) — 40(0.30) = 400+60-12= Rs.448 
8.16. VARIANCE AND STANDARD DEVIATION 

By definition § 

Var (X) = o? = E[X-E(X)]* = E(X*)-[E(X))? 

and S.D(X) = VE(X?)-[E(X)} 
Example 8,17. 

A random variable X has the probability distribution: 


t= k(5) x =0,1,2,3,4,5. 


(a) Determine the value of k. (b) Determine the expected value of X. 


(c) Determine the variance of X. (d) Show that Var( 4X + 9) = 16Var(X). 
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Solution: 


(a) Since the sum of probabilities is one. 


: f( 
Ef 


l(c) +(r)+(2)+( 


= 1 gives 


+ 


3)+( 


i)+(5)) 


1 


k[1+5+10+10+5+1] = 1 or k[32] = 1, sothat k = 1/32 


33 (3) 


Thus, f(x) = 


x = 0,1, 2,3, 4,5. 


x? f(x) (4x + 9)? f(x) 
ty) 0 9 9/32 81/32 
5/32 5/32 13 65/32 845/32 
20/32 40/32 17 170/32 2890/32 
30/32 90/32 21 210/32 4410/32 
20/32 80/32 25 125/32 3125/32 
5/32 25/32 [ 29 sn 29/32 841/32 
| Total 80/32 | 2107 [es Gos/s2 | 12192/32 
(b) E(X) = Ixf(x)= 80 Shiels 
E( X?) = &x? f(x) = = 7.5 
(c) Var(X) = E(X?)-[E(X)]* = 7.5-(2.5)? = 1.25 
E(4X+9) = 3(4x+9) f(x) = 808 19 
E(4X+9)? = D(4x+9)? f(x) = — 
(a) Var(4X+9) = E(4X+9)*-[E(4X+9)]* =381—(19)?= 20 
16Var(X) = 16(1.25) = 20 


Hence, Var( 4X +9) = 16 Var(X) 
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SHORT DEFINITIONS 
NS a 2 i i alt 


Random Variable 
A random variable is a numerical quantity whose value depends on chance. 

or 
A rule that assigns a numerical value to each outcome of a sample space is called 
random variable. 
Probability Distribution 
A probability distribution gives the probability for each value of the random 
variable. : 

or 
A list containing both the values of the random variable and their corresponding 
probabilities is called the probability distribution of the random variable, 
Discrete Sample Space 
A sample space is called a discrete sample space if it contains a finite or countable 
number of sample points. 
Discrete Random Variable 
A discrete random variable is one which can assume any set of possible values which 
can be counted or listed. 

or 
When the values of the random variable are countable, it is called a discrete random 
variable. 
Discrete Probability Distribution 
A probability distribution is called a discrete probability distribution if it refers to a 
discrete sample space. 
Probability Function 
A function denoted by f(x) that can provide the probability for each value of the 
discrete random variable is called a probability function. 
Properties of the Discrete Probability Function 
The probability function for a discrete random variable must have the following two 
properties: 
@) The range of f(x) is the set of real numbers from 0 to 1, inclusive. That is 

O< f(x) <1. 
(ii) The sum of the probabilities assigned to the values of a random variable must 
be equal to one, that is, Zf(x) = 1, 

Continuous Sample Space 
If the sample space of an experiment is the real line or intervals on the real line, we 
call it continuous. 
Continuous Random Variable 
A continuous random variable is one which can assume an infinite spectrum of 
different values across an interval and which cannot be counted or listed. 

or f 
A variable that can assume any possible value between two points is called a 
continuous random variable. 
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Probability Density Function 
A probability density function is a formula, or equation, used to represent the 
probability distribution of a continuous random variable. 
or 
The function that defines the probability distribution of a continuous random 
variable is called probability density function. 
Mathematical Expectation or Expected Value of X 
The expected value of a discrete random variable X is equal to the sum of the 
products of each value of x and the corresponding value p(x) i.e; E(X) = =x p(x) 
or 

The expected value or mean of a random variable X is defined as follows: 
(i) If X is a discrete random variable and f(x) is the probability of x, then the 

expected value of this random variable is E(X) = Ex f(x). 
(i) If X is a continuous random variable and f(x) is the value of its probability 

density at x, then the expected value of this random variable is 

E(X)= j x f(x) dx. 

=) 
MULTIPLE - CHOICE QUESTIONS 
7. If in a table all possible values of a random variable are given their 
corresponding probabilities, then this table is called as: 


(a) probability density function (b) distribution function 
(c) probability distribution (@) continuous distribution 

2. Avvariable that can assume any possible value between two points is called: 
(a) discrete random variable (b) continuous random variable 
(c) discrete sample space (d) random variable 


3. A formula or equation used to represent the probability distribution of a 
continuous random variable is called: 2 
(a) probability distribution (b) distribution function 
(c) probability density function (a) mathematical expectation 

4, If X is a discrete random variable and f(x) is the probability of X, then the 
expected value of this random variable is equal to: 


(a) ZXf(x) (b) C[x+f(x)] 
(c) X£(x) +x (d) Xxf(x) 

5. Given E(X) =5 and E(Y) =- 2, then E(X -Y) is: 
(a) 3 ) 5 
() 7 (d) -2 

6. Given x = 2 and f(x) = 0.5. If y = 2x - 3, then f(y) is equal to: 
(a) 1 (b) 0.5 


(ce) “= 2 @ oO 
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7. | Which of the following is not possible in probability distribution? 


(a) p(x) 20 (b) Lp(x) =1 

(c) Xxp(x) =2 (d) p(x) =-0.5 
8. Ife is a constant (non-random variable), then E(C) is: 

(a) 0 (by) 1 

(c) cf(c) (dd) c 
9. A discrete probability distribution may be represented by: 

(a) table (b) graph 

(c) mathematical equation (d) all of the above 
10, A probability density function may be represented by: 

(a) table (b) graph 

(c) mathematical equation (d) both (b) and (c) 


11. If C is a constant in a continuous probability distribution, then p(x = C) is 
always equal to: 


(a) zero (b) one 
(c) negative (d) impossible 5 
12, E[X -E(X)] is equal to: 
(a) E(X) (b) Var(X) 
(©) 0 (@) E(X)-x 
18, E[X -E(X)]? is: 
(a) E(X) (b) E(X?) 
(c) Var(X) (d) S.D.X) 
14. If the random variable takes negative values, then the negative values will 
have: 
(a) positive probabilities (b) negative probabilities 
(c) constant probabilities (d) difficult to tell 
15. Ifwe have f(x) = 2x, o< x < 1,then f(x) is a: 
(a) probability distribution (b) probability density function 
(c) distribution function (d) continuous random variable 
16. Numbers selected by a random process and are equally distributed in a table 
are called: 
(a) attributes (b) random variables 
(c) random numbers (d) quantitative variables 


ai. J f(x) dx is always equal to: 
-0« 
(a) zero (b) one 
(c) E(X) (d) f(x) +1 
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18. 


19. 


20. 


21. 


22. 


23. 


24, 


25. 


26. 


Basic Statistics Part-I 


A listing of all the outcomes of an experiment and the probability associated 
with each outcome is called: 


(a) probability distribution (b) probability density function 
(c) attributes (d) distribution function 


A quantity resulting from an experiment that, by chance, can assume different 
values is called: 

(a) random experiment (b) random sample 

(c) random variable (d) random process 

Which one is not an example of random experiment? 

(a) Acoin is tossed and the outcome is either a head or a tail 

(b) A-six-sided die is rolled 


(c) Some number of persons will be admitted to a hospital emergency room 
during any hour. 


(d) All medical insurance claims received by a company in a given year. 
A set of numerical values assigned to a sample space is called: 

(a) random sample (b) random variable 

(c) random numbers (d) random experiment 


A variable which can assume finite or countably infinite number of values, is 
known as: 


(a) continuous (b) discrete 
(c) qualitative (d) none of them 
The probability function of a random variable is defined as: 


x 

f(x) 
Then k is equal to: 
(a) zero (b) 1/4 
(c) 1/15 (dy one 
If f(x) = 1/10, x= 10, then K(X) is: 
(a) zero (b) 6/8 
@ 1 (@) -1 
If Var(X) =5 and Var(Y) = 10, then Var( 2X +Y ) is: 
(a) 15 (b) 20 
(c) 10 (d) 30 
A discrete probability function f(x) is always: 
(a) non-negative (b) negative 


(c) one (d) zero 
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27. 


28, 


29. 


30. 


31. 


32. 


33. 


84, 


In a discrete probability distribution the sum of all the probabilities is always 
equal to: 


(a) zero (b) one 

(c) minimum (d) maximum 

The suitable graph of probability function of a discrete random variable is: 
(a) curve (b) polygon 

(c) probability histogram (d) histovigram 

The appropriate graph,of probability density function is. 

(a) curve ? (b) histogram 

(c) polygon (d) none of them 


A variable which can assume all values in the range of a random variable, is 
called: 

(a) finite (b) infinite 

(c) continuous (d) discrete 

Total area under the curve of a continuous probability density function is 
always equal to: 


(a) zero (b) one 

(c) -1 (d) none of them 

An expected value of a random variable is equal to its: 

(a) variance (b) standard deviation 
(c) mean (d) covariance 


The probability of a continuous random variable "X" taking any particular 
value, k is always: 


(a) negative (b) zero 
(c) one (d) none of them 
Area of a trapezoid is equal to: 
E 

(a) =e endicular ee (b) length x breadth 

§ i bs 
© sum of parallel sides x base (ie aoneattien 
Var(4X + 8) is: 
(a) 12 Var(X) (b) 4 Var(X) +8 
(c) 16 Var(X) (d) 16 Var(X) +8 
Var(X) is equal to: 
(a) E(x?) (b) [E(X)]’ 
() E(x’) - [E(X)]’ (@) E(X*)+[B(X)]}? 
The expectation of the sum of two random variables X and Y is equal to: 
(a) E(X) E(Y) (b) E(X) + E(Y) 


(c) EC X#Y) (d) E(XY) 
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88. The expectation of the product of two independent variables X and Y is equal to: 
(a) E(X) E(Y) (b) E(X) +E(Y) 
(c) E( X+Y) (d) none of the above 
89. When the random variable X and Y are independent, its co-variance is: 
(a) one (b) negative 
(c) zero (d) positive 
40, A discrete probability function f(x) is always non-negative and always lies 
between: 
(a) O anda (b) Oand1 
(c) -—land+1 (d) -atot+o 
41. The probability density function p(x) cannot exceed: 
(a) zero (b) one 
(c) mean (d) infinity 
42, The height of persons in a country is a random variable of the type: 
(a) discrete random variable (b) continuous random variable 
(c) both (a) and (b) (d) neither (a) and (b) 
43. Arandom variable is also called: 
(a) chance variable (b) stochastic variable 
(c) constant (d) both (a) and (b) 
44. The distribution function F(x) is equal to: 
(a) P(X=x) (b) P(X<x) 
(ce) P(X2x) (d) all of the above 
l ©[2 ®[3 ©]4 @/5 O16 o®|7 @/ 8. @ 
9% @ 10. @/1. @ [12 © /13. ©1114 (@[15. &) /16. © 
17. (b) | 18. @) [19 © | 20. @ | 21. © [22 © [23. © | 24. © 
25. (d) | 26. (a) [ 27. ()[28. (| 29. (a) | 30. (c) | 31. () | 32. () 
33. (b) | 34. (c) | 35. (©) | 86. (©) | 87. (b) | 38. (a) | 39. (©) | 40. (b) 
41. (b) | 42. (b) | 43. (d) | 44.) ! 


SHORT QUESTIONS 


Q.1 


Q2 
Q3 
Q4 
Q.5 


What is a random variable? 


Describe the two different types of random variable. 

Define a continuous random variable. 

What is the sample space of a random experiment? 

Define a probability distribution for a discrete random variable. 
or 

What is meant by probability distribution? 
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Q.6 
Q.7 
Q.8 
Q.9 
Q.10 


Explain the probability function. 

Write short note on discrete random variable. 

Present the graph of discrete probability function. 

Define the term discrete probability function. 

What are the properties associated with a discrete probability distribution? 
or 

Write down the properties of discrete probability distribution. 

What is meant by mathematical expectation of a random variable? 

Write down the laws of mathematical expectation. 
or 

Write down the properties of expected values, 

Define a random variable and its expectation. 

Which formulas should be used to calculate the mean, variance and standard 

deviation of discrete probability distribution? 

What is a probability density function? 

Write down the properties of probability density function. 

Distinguish between discrete and continuous random variables giving 

examples, + 

What do you mean by expécted value? 

What does “ p.d.f” stand for? 

What are the expectation and standard deviation of a constant? 

E(C) = C and S.D.(C) =0 

Explain the probability mass function, 

Write down the formula for the probability distribution of the sum of spots 

when a pair of dice is thrown. 


ilitins 
pox) = STL 3 8,8, 4, 12, 
Explain the application of random numbers. 
Which methods are used in drawing random numbers? 
Given f(x) = a x= 1, 2, 3, 4. Show that f(x) is a probability function, 


Ef(x) = 1 
Given x = 0, 1, 2 and p(x) = 5/8, 4/8, 1/8, Is this probability distribution? 
No because Dp(x) = 1.25 


Given f(x) = x x= 1, 2,3. Findk, 
6/11 
Given p(x) = x (4) and x = 0, 1, 2, 3, 4, Find the value of k. 


1/16 4, 
Given x = 10, ~ 20, 30 and p(x) = 2k, 8k, 5k. Find P(X < 30), 
0.5 
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Q.30 Given x =— 3, — 2, 2, 3 and p(x) =¢, 4c, 3c, 2c. Find P(X >—3). 

Ans. 0.9 

Q.31 Given x = —3, -1, 0, 1, 2, 5 and p(x) = 01, ¢ 0.2, 2c, 0.3, 3c. 
Find P(-1<X<+1). 

Ans. 0.4 


Q.82 What would be the probability distribution for the outcomes associated with 
the throw of a fair six-sided die? 


Anis.\[ 0 3eF fate [irae a] 37 | ea] 
[ec [ive [ive [v6 | v6 | ve | 6 


Q.33 Given x = 1, 2, 3,4, 5 and p(x) = 1/10, 3/10, p, 2/10, 1/10. Find the value of p. 

Ans. 3/10 

Q.34 Three coins are tossed if x = number of tails, then make the probability 
distribution of x. 


Ans. 


ae a a ee ee ee ee ee 
p(x) 1/8 3/8 3/8 1/8 1 


Q.35 Two coins are tossed, if x = number of heads, then make the probability 
distribution of x. 


TaN 


Q.36 Given Head : Tail = 4; 2. Find P( Head ) and P( Tail ). 
Ans. P(Head) = 2/3 and P(Tail) = 1/3 


Q.37 A pair of dice is thrown, find the expected value of the number of sixes after 
making its probability distribution. 
Ans. 1/3 


Ans. 


2x+3 om i eee 
aoe Whether this is a probability distribution 


Q.38 Given x =-1, 0, 1 and p(x) = 
or not? 

Ans. No because Yp(x) = 2.25 

Q.39 Given x = 2, 4, 6 and p(x) = 0.3, Y, 0.2. Find the value of Y. 

Ans. 0.5 

Q.40 Given x=-1, 0, 1, 2 and p(x) =k|x- 2]. Find P(X =0). 

Ans. 1/3 

Q.41 Given x= 0, 1, 2, and p(x) = 5/40, 25/40, 10/40. Find E( X). 

Ans, 1.125 

Q.42 Given f(x) = * x= 1, 2. Find E( X ). 

Ans. 1.2 
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Q.43 


Ans. 
Q.44 


Ans. 
Q.45 


Ans, 
Q.46 
Ans. 
Q.47 
Ans. 
Q.48 


Ans. 
Q.49 
Ans, 
Q.50 


Ans, 


Q.51 
Ans. 


Q.52 


Ans. 
Q.53 
Ans. 


Q.54 
Ans. 


Q.55 


Ans, 
Q.56 
Ans. 


Q.57 
Ans. 


Let the random variable X have the probability function: 
+1)2 
f(x) = Ut De x=-1, 0, 1. Compute E( X ). 


zero 
What is the mathematical expectation if we stand to win Rs. 10 if head falls 
on a balanced coin? 

Rs.5 

What is the mathematical expectation to win Rs. 50 if a head appears and 
lose Rs. 50 if a tail appears? 

zero 

Given x = 1, 2, 3 and p(x) =a, b, a where a and b are constants. Find E(X) 

2 

Given x = -1, -2, 0, 1, 2 and p(x) =k, 2k, 3k, 4k, 5k. Find K(X). 

0.6 

If a pair of fair dice is rolled and X denotes the sum of the numbers on them, 
then find the expected value of X. 

" 

A die is rolled if x = face value, then find E( X). 

3.5 

Given the discrete probability distribution p(x) = (x2+ 1 )/5 for x = -1, 0, 1. 
Find E(X). 

0 

Given p(x) = I/n and x= 1, 2, 3,....,n. Find E(X ). 

(n+1)/2 

A and B enter into a bet according to which A will get Rs.1000 if it rains on 
that day and will lose Rs.500 if it does not rain. The probability of raining on 
a day is 0.7. What is the mathematical expectation of A? 


Rs. 550 
Given p(x) =c|x-2] andx = 1, 0, 1, 3. Find E(X), 
V/7 
Given x = 0, 2, 3 and p(x) = deal Find E(X). 
2 
syayr1\-* 
Given the discrete probability distribution p(x) = * X3) (3) x = 0,1,2,3. 
Find E( X?). 
3 
Given x = 1, 2, 3 and P(x) = 5/8, 2/8, 1/8. Find E(X+5). 
6.5 
Given x = 2, 4, 6 and p(x) = 2/6, 2/6, 2/6. Find E( 2X ). 
8 
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Q.58 
Ans, 
Q.59 
Ans. 
Q.60 
Ans. 
Q.61 


Ans. 
Q.62 
Ans. 
Q.63 


Ans. 
Q.64 


Ans. 
Q.65 
Ans. 
Q.66 
Ans. 


Q.67 


Ans. 


Q.68. 


ims, 
Q.69 
Ans, 
Q.70 
Ans. 


Q.71 


Q.72 
Ans. 
Q.73 
Ans. 
"Q.74 
Ans. 
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Given x = 1, 2, 3 and p(x) = 3/10, 6/10, 1/10. Find E( 2X*). 
7.2 
Given x = 2, 3, 4, 5 and p(x) = 3/10, 2/10, 3/10, 2/10. Find E [X-E(X) ]. 
0 
Given x = 1, 2, 3, 4 and p(x) = 1/8, 4/8, 1/8, 2/8. Find E(X?+ 2). 
9.25 
Find the mean and standard deviation for the following discrete distribution: 
f(x)=1 and x=5 
5;0 
Given E( X )=0 and E( X )?=8/9, find E( 3X?-2X+4 ), 
6.67 
What is the expectation of a person who is to get Rs.240 if he obtains 3 heads 
in a single toss of three coins. 
30 
If it rains a raincoat dealer can earn Rs.500 per day. If it is fair he can lose 
Rs. 100 per day. What is his standard deviation if the probability of rain is 
0.4? 
Rs, 272.76 
Given x = 0, 1, 2 and p(x) = 9/16, 6/16, 1/16. Find Var( X ). 
0.375 
Given E( X ) = 0.5, Var( X ) = 1.44 and Y= 2X + 2. Find E(Y) andS.D(Y). 
(3, 2.4) 
2 X-120 
Given E( X_) = 120 and Var ( X ) = 100. If Y = —i98 * then find mean and 
variance of Y. 
(0, 1) 
Two coins are tossed. If x denotes the number of heads. Find E( X ). 
1 . 
Given E(5X +10) = 18 and E( 5X + 10)? = 524. Find Var(X). 
8 
Given E( X - 2 )? = 25 and Var( X - 2) = 9. Find E(X) 
6 
Given Var(X) = 2 and Var(Y) = 4. If X and Y are independent random 
i 
variables then show that Var ( ie x) = 0.75. 
Given E(X) = 200 and S.D.(X) =5. Find E( X?). 
40025 
Given E(X) =5 and Var(X) = 1. Find E( 2 —- 3X) and Var( 2 - 3X). 
- 13 and 9 
Given E( 2X ) = 10 and E( 2X )? = 500, determine $.D(X). 
10 
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Q.75 Given a random variable X with E( X ) = 2.5 and Var( X ) = 1,25. Find E( X*), 

Ans. 7.5 

Q.76 Given E( X*) = 400 and 8.D.(X) = 12. Find E(X). 

Ans. 16 

Q.77 Given E(X) = 0.55, Var( X) = 1.35 and Y = 2X + 1, Find E(Y ) and Var( Y ), 

Ans. 2.1 and 5.4 

Q.78 Given E(X+ 4)=10 and E(X+4 )2= 116, determine Var ( X +4 ). 

Ans. 16 

Q.79 Given U(X+4)=10 and E( X+4 )?=116, determine E( X ) and Var ( X ). 

Ans. 6;16 

Q.80 Given E(X) = 200, C.V. (X) = 7%. Find Var( X)., 

Ans. 196 

Q.81 A continuous random variable X that can assume values between x = 0 and 
x = 2 has a density function given by f(x) = x/2, Show that the area under the 
curve is equal to 1. i 

Ans. Total Area =1 

Q.82 A continuous random variable X has a density function f(x) = c( 4 - x ) for 
x= 1 tox = 8, zero otherwise. Find c. 

Ans. 1/4 

Q.83 A continuous random variable X that can assume values between x= land 
x= 3 has a density function given by f(x) = 1/2. Show that the area under the 
curve is equal to one. 


Ans. Total Area = 1 


Q.84 Acontinuous random variable X has probability density function f(x) = cx, for > 
0<x< 1 and f(x) = 0, elsewhere. Find the value of c. 


Ans, 2 


Q.85 A continuous random variable X has probability density function f(x) = 0.1, 
for x = 0 to 10 and f(x) = 0, elsewhere. Find P(X=5). 


Ans. 0 


Q.86 A continuous random variable X has a density function f(x) = er forl<x<4. 
Find the value of c. 
Ans. 8/15 
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EXERCISES 


Q.1 Three balls are drawn from a bag containing 5 white and 3 black balls. {f 'X' 
denotes the number of white balls drawn from the bag, then find the 
probability distribution of X. z 

Ans. x: 0 1 2 3 
p(x): V/56 15/56 30/56 10/56 

Q.2 There are seven candidates for three positions of typist. Four of the candidates 
know urdu typing, while the other three do not know it. If the three candidates 
are selected at random, find the probability distribution of the number of 
persons knowing urdu typing among those selected. :. 


Ans. x: 0 1 2 3 
p(x): 1/35 12/35 18/35 4/35 

Q.3 A coin is tossed 4 times. If X denotes the number of tails, what is the 
probability distribution of X? Draw a probability histogram. 

Ans. x: 0 at 2 3 4 
p(x): 1/16 4/16 6/16 4/16 1/16 


Q.4 A bag contains 4 red and 6 black balls. A sample of 4 balls is selected from the 
bag without replacement. Let X be the number of red balls. Find the 
probability distribution of X. 


Ans. x: i) x 2 3 4 
p(x): 15/210 80/210 90/210 24/210 1/210 
Q.5 A random variable X has the probability distribution: 


(i) Determine the value of a. (ii) Find P(X<2 ), P(X 26), P(3<X<5). 
Ans. (i) 1/81 (ii) 4/81, 5/9, 1/9 
G.6 A random variable X has the following probability distril ition: 


Find the probability distribution of Y where Y = X?+1. 

Ans. y: 1 2 5 10 
pty): O21 0.2 0.3 0.4 

Q.7 Find the probability distribution of the number of boys in families with three 
children, assuming equal probabilities for boys and girls. 

Ans. x: 0 1 2 3 


p(x): 1/8 3/8 3/8 Vs 
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Q.8 From a lot containing 12 items, 4 of which are defective, 5 are chosen at 
random. If X be the number of defectives found in the sample, write down 


(i) the probability distribution of X (ii) P(X < 1) 


4 
Gii) P(1<X<3) (iv) Verify 2 pl x)=1. 
x= 
Ans. (i) x: 0 a 2 3 4 
p(x): 7199 35/99 42/99 14/99 1/99 (ii) 14/33 Gii) 14/33 


ea, 
Q.9 A continuous random variable X has a density function f(x) = a for x= 2 
tox = 4. Find (i) P(X < 3.5) (ii) P( 2.4< X $3.5) G@ii) P(X = 1.5). 
Ans. (i) 0.703 (ii) 0.543 (iii) 0 
Q.10A continuous random variable X that can assume values between X = 2 and 
2 
X =5 has a density function given by f(x) = 27 (x+1). Find 


(i) P(X <4) (ii) P08 < X <4). 

Ans. (i) 16/27 (ii) 1/3 

Q.11A continuous random variable X which can assume values between X= 2 and 8 
inclusive has a density function given by a (x + 8) where ‘a’ is a constant. Find 
@a Gi) P(3< X <6) @ii) P(X<6) (iv) P(X 24). 

Ans. (i) 1/48 Gi) 15/32 (iii) 7/12 (iv) 3/4 

Q.12A continuous random variable X having values only between 0 and 4 has a 
density function given by. 


f(x) =3 ~—ax, where 'a' is a constant Find: (i) a Gi) P(.1<X <2) 
Ans. (i) 1/8 (ii) 5/16 
Q.13 A continuous random variable X has probability density function 
f(x) =cx for 0<X<2. Determine (i) ¢ (ii) probability that 1< X< 1.5 
(ii) probability that X < 1.5. 
Ans. (i) 1/2 ii) 0.8125 iii) 0.5625 
Q.14 Ifa function f(x) = 4 (3+ 2x), 2<X< 4. Show that it is a density function and 
find the probability that 2< X s 3. 


Ans. 4/9 
Q.15 Given the following discrete probability distribution: 


Compute its mean, variance, standard deviation and coefficient of variation. 
Ans, 1.944, 2.054, 1.433, 73.71 % 
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Q.16 Let X be a random variable with probability distribution as follows: 
Ce ee eee 

Show that E( 5X + 8) = 5E(X) +8. 

Ans, 21 

Q.17 Given a random variable X with E(X) = 5 and another random variable Y with 
E(Y) = 10. Find E( X + 2Y) and E( 12 - 2X). 

Ans. 25, 2 

Q.18 A random variable X has the following probability distribution: 


Probability Distribution of X, 
1 
2 
3 


P(X=1) = 6/9 
P(X=2) = 2/9 
P(X=3) = 1/9 


(i) Determine the expected value of X (ii) Determine the variance of X 
(iii) Determine the standard deviation of X. 

Ans. (i) 1.444 Gi) 0.471 (iii) 0.686 

Q.19 Let X be a random variable with probability distribution: 


[ae a 0 1 2 3 
| p@ 0.125 | 0500 | 0200 | 0.050 | 0.125 


Find the probability distribution of the random variable Y = 2X + 1. Using the 
probability distribution of Y, determine E( Y ) and Var(Y ). 


Ans. y: -1 1 3 5 7 E(Y)=2.1 
p(y): 0.125 0.500 0.200 0.050 0.125 Var( Y ) = 5.39 


Q.20If f(x) = sol texl for x = 2, 3, 4, ..., 12, then find the mean and variance 
of the random variable X. 

Ans, 7, 5.83 

Q.21Find (i) E(X) Gi) E(X?) @ii) EEX - E(X)]° for the following probability 
distribution. 


Ans. (i) 7 (i) 590 @ii) 541 
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Q.22 A random variable X takes the values — 3, — 2, 2, 3. and 4 with probabilities p(x) 
equal to 1/5, 1/10, 1/10, 1/5 and 2/5 respectively. Compute the variance of 
(5X +10). 

Ans, 206 ( 

Q.23 Let X have the following probability distribution: 


Find E( 3X —1), E(X*) and E(X? +2). 
Ans. 7.1, 8.9, 10.9 
Q.241f E(X)=5, E(X?) = 36, find the mean and variance of 2X — 6. 
Ans. 5, 44 


Q.25 What is the expectation of a person who is to get Rs.80 if he obtains 3 heads in 
a single toss of 3 coins. 


Ans. Rs.10 


Q.26A committee of size 5 is to be selected at random from 3 women and 5 men. 
Find the expected number of women on the committee. 

Ans. 2 approximately 

Q.27A and B throw a die for a prize of Rs.55, which is to be won by the player who 
first throws 6. If A has the first throw, what are their respective expectations? 

Ans. Rs.30, Rs.25 

Q.28 Three balls are drawn from a bag containing 5 white and 3 black balls. If X 
denotes the number of white balls drawn from the bag, then find the 
probability distribution of X. Also find its mean and variance. 

Ans. x: 0 1 2 3 *E(X) = 1.875 
p(x): 1/56 15/56 30/56 10/56 Var( X) = 0.502 

Q.29 If a pair of fair dice is rolled and X denotes the sum of the numbers on them, 
then find the expectation of X. If only a single die is rolled what is the 
expectation of a number on the die. 

Ans. 7, 3.5 

Q.30 A man draws 2 balls from a bag containing 8 white and 5 black balls. If he is to 
receive Rs,70 for every white ball which he draws and Rs.7 for every black ball, 
find his expectation. 


Ans. Rs.61.25 
Q.31A person plays a game of tossing a die under the condition that he would get 


twice as many rupees as the number of points on the upper face. Find his 
expectation. 


Ans. 7 
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Q.32 In a summar season, a dealer of desert room coolers can earn Rs.800 per day if 
the day is hot and can earn Rs.200 per day if it is fair and loses Rs.50 per day if 
it is cloudy. Find his mathematical expectation if the probability of the day 
being hot is 0.40 and for being cloudy it is 0.35. 

Ans. Rs.352.5 


Q.33 A coin is biased such that a head is thrice as liJely to occur as a tail. Find the 
probability distribution of heads and also find the mean and variance of 
distribution when it is tossed 4 times. 


’ Ans, x: 0 1 2 3 4 E(X)=3 
p(x): 1/256 12/256 54/256 108/256 81/256 Var(X)=0.75 


Q.34 Approximately 10 % of the glass bottles coming from a production line have 
serious defects. If two bottles are selected at random, find the expected number 
of bottles that have serious defects. 


Ans. E(X) = 0.2 


Q.35 If it rains a raincoat dealer can earn Rs.500 per day. If it is fair he can lose 
Rs.100 per day. What is his expectation if the probability of rain is 0.4? 


Ans. Rs.140 


Q.36 A and B enter into a bet according to which A will get Rs.200 if it rains on that 
day and will lose Rs.100 if it does not rain. The probability of raining on a day 
is 0.7. What is the mathematical expectation of A? 


Ans. Rs.110 


Chapter 
9 


BINOMIAL AND HYPERGEOMETRIC 
DISTRIBUTIONS 


9.1 INTRODUCTION 

In random experiments, we can divide a sample space into two parts. Fruit in a 
basket can be divided into two parts sweet and sour, in a throw of die the six faces 
can be divided into even and odd faces or prime and non-prime faces, When two dice 
are thrown, the total on the dice may be divided into even or odd. These experiments 
have only two possible outcomes. A very simple example of this type of experiment is 
throw of a coin which shows a head or a tail. The probability of each outcome is not 
disturbed by repeating such experiment. In tossing a coin large number of times, the 
probability of head remains constant and every time the result of coin is 
independent of previous throws. We apply the binomial distribution in all these 
situations in which sample space is divided into two parts according to some 
definition of partition. Independent trials with only two possible outcomes and 
constant probabilities are called binomial or, sometimes, Bernoulli Trials, after J. 
Bernoulli (1654 — 1705), who was a pioncer in the field of probability. 
9.2 BERNOULLI TRIALS 

A student selected from a class may be intelligent or non-intelligent, a bulb 
selected from a big lot may be good or defective, a person may be smoker or non- 
smoker. These trials have only two possible outcomes, One of these outcomes is 
called success and the other is called failure. The word success is not always used for 
outcomes like "intelligent", "good" or "useful". This word is used for the outcome in 
which we are interested, It may be used for a non-intelligent student or for a 
defective bulb. The probability of success is usually denoted by p and the probability 
of failure is denoted by q (q = 1—p). The term Bernoulli trials is used for these trials 
provided the probability of success p does not change from trial to trial. 
9.3. BINOMIAL EXPERIMENT 

An experiment with n independent trials in which the outcomes can always be 
classified as either a success or a failure and the probability of success remains 
constant from trial to trial is called a binomial experiment. A binomial experiment 
has the following properties, 
(i) Each trial results in two outcomes which can be classified into success and 

failure. 
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(ii) The probability of success remains constant from trial to trial. 

(iii) The successive trials are independent. 

(iv) The experiment is repeated a fixed number of times say, n. 

9.4 BINOMIAL RANDOM VARIABLE 


Let X be the total number of successes in n independent Bernoulli trials, with p 
as the probability of success in a single trial, then X is called the binomial random 
variable with parameters n and p. 

9.5 THE BINOMIAL DISTRIBUTION 


The binomial distribution was discovered early in the eighteenth century by 
the Swiss mathematician, J. Bernoulli (1654 — 1705). It was published after his 
death in 1713. If p is the probability that an event will happen in any single trial 
and q = 1-—p is the probability that it will fail to happen in any single trial, then the 
probability that the event will happen exactly X times in n trials (i.e. x successes and 
n-—x failures will occur) is given by 

n n! 

P(K=x)= f(x) =boe np) = (8) pares = goby pre? 

where x=0, 1, 2, 3,...,n and n!=n(n—1) (n—-2)... 3.2.1. O!=1 

The probability distribution of the number of successes so obtained is called the 
binomial probability distribution for the obvious reason that the probabilities of 0, 1, 
2, 3, «+, X, «ss, m successes are the respective terms in the binomial expansion 
(q+ p)®. The binomial distribution contains two independent constants, n and p. If n 
and p are known then we can determine all measures like mean, variance, 
coefficient of skewness etc. of the distribution. They are called parameters of the 
binomial distribution. If p = q = 1/2, the binomial distribution is a symmetrical 
distribution and when p # q, it is a skewed distribution. 
9.6 GRAPH OF THE BINOMIAL DISTRIBUTION 

The probability function will be symmetrical when p = 1/2 ( Figure 9.1.). If 
p > 1/2 ( Figure 9.2 ), the probability function will be negatively skewed. If p< 1/2 
( Figure 9.3 ), it will be positively skewed. The greater the difference between p and 
1-—p, the greater the skewness of the probability function. However, as n increases, 
the probability function approaches symmetry regardless of the difference between p 
and 1—p. The degree of skewness can be measured with the help of a formula called 

| G—p 
coefficient of skewness = c 

Vnpa 
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Figure 9,1. Binomial distribution for n=4, p = 1/2. 


b(x; 4, 1/2) 
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0.4 a, 0.3750 


03 4 
0.2500 0.2500 


v4 


O1 + 0.0625 0.0625 


0 1 2 2 4 
Figure 9.2. Binomial distribution for n= 4, p = 7/10. 


b(x; 4, 7/10) 
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Figure 9.3. Binomial distribution for n=4, p = 3/10. 
b(x; 4, 3/10) 


Example 9.1. 
An event has the probability p = 2/5. Find the complete binomial distribution 
for n= 5. 


Solution: 
The probability of x successes in a series of n trials is given by 
P[X=zx] “= bn,) = (2) pt qn-* 0) 1 258) 0 T- 


Here, n=5,p = 2/5 and q= l-p= 3/5 
Let the random variable X denote the number of successes, then the possible 
values of X are 0, 1, 2, 3, 4 and 5. Therefore 


P[X=x] = b(@5,25) = (2 (2y@ Pee = On DeB aS: 
p[x=o] = besa) = (S)(2)(@) = oe 
p[x=1] = va;5,28) = (3)(3 (2) - 2 
pix=2) = v5.28) = (S)(2f(2) = dios 
p[x=3] = b(3;5,2) = (3)(2) ey = oe 
p[x=4] = b(4;5,28) = (°)(? *(3) ato 
p[x=5] = b(:5,25) = (2)(2) ay = sa 
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Thus the complete binomial distribution with p = 2/5 and n = 6 in tabular 
form is ea as follows: 


CE OS I 


243 | 810 | 1080 | 720 | 240 | _32 ie 
Pal 3125 3125 | 3125 gi25 | PPIX=x 


Example 9.2. 

The experience of a house-agent indicates that he can provide suitable 
accommodation for 75 percent of the clients who come to him. If on a particular 
occasion, 6 clients approach him independently, calculate the probability that: 

(i) less than 4 clients will get satisfactory accommodation 

(ii) exactly 4 clients will get satisfactory accommodation 

(iii) at least 4 clients will get satisfactory accommodation 

‘Gy) at most 4 clients will get satisfactory accommodation. 
Solution: S 

The probability of x successes in a series of n trials is given by 


P[X=x] = (s} p*qn-* x=10, 1, 2, 3, .«, 0. 

Here, n = 6,p = 75/100 = 3/4, and q=1-p= 1/4 

Let the random variable X denote the number of clients who will get 
satisfactory accommodation. Then the possible values of X are 0, 1, 2, 3, 4, 5, 6. 
Therefore 


Bes 6 Se aes x=0,1,2,3,4,5,6. 


oma AOE 
+(5)(2) a} 
i on +08 Ht ae ea 3 ies = 0.1694 
(i) PIX=4] = (2) = LBS = o2066 
REEO ORO ROUIO RO CIO} 


seth 1458 729 3402 


P(Beoney ater) 


(Gv) P[X<4] 


tl 
yas 
on 
a 
eae 
Ee 
S 
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Example 9.3. 


Large lots of incoming products at a manufacturing plant are inspected for 
defectives by means of a sampling scheme. Only 8 items are to be examined and the 


(i) accepted? (ii) rejected? 
Solution: 
The probability of x successes in a series of n trials is given by 


P[X=x] = b(x; n, p) = (2) p®qn-x x=0, 1, 2, 3,..., n. 


Here, n= 8p = 10/100= 1/10 and q=1-p = 9/10 
Let the random variable X denote the number of defective items, then the 
possible values of X are 0, 1, 2, 3, ..., 8. Therefore 


P[X=x] (2)(4) "(3)" x=0,1,2,3,...,8. 
0 


p 8)(1)\ 9) (8)( 1) 9) 

@ F[X<2] = eae: Nas) (2)(S)(8) 
= 0.4305 + 0.3826 = 0.8131 

1-P[X<2] = 1-0.8131 = 0.1869, 


Gi) P[X22] 
Example 9.4 


Assuming that each baby has probability 0.4 of being male, find the probability 
that a family of 4 children will have 


(i) exactly one boy (ii) exactly one girl (ii) at least one boy 
(iv) at least one girl (v) at most one boy (vi) at most one girl 
Solution: 


The probability of x successes in a series of n trials is given by 
P(X=x) = b(x;n,p) = a P¥q?-* x=0,1, 2,3, --,n. 


Here, n= 4, p=0.4, q=1-p=1-04=06 
Let the random variable X denote the number of boys, then the possible values 
of X are 0, 1, 2, 3, 4. Therefore 


P(X=x) = (4) (0.4)* (0.6)4-* x=0, 1, 2,3, 4, 


@ P(X=1) = (4) (0.4)! (0.6) = 0.3456 
(ii) P(X=8) = (4) (0.4)3 (0.6)! = 0.1536 
(ii) P(X21) = 1-P(X=0) = 1-(4) 0.99 @.6+=1- 0.1296 = 0.8704 
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(iv) P(X <3) 


i] 


1-P(X=4) = 1 aie} (0.4)4 (0.6) = 1 — 0.0256 = 0.9744 


(vy) P(X<1) 


iT} 


4 
Ea (0.4)? (0,6)4 + ( | (0.4)! (0.6)3 
0.1296 + 0.3456 = 0.4752 
ia (0.4)3 (0,6)! + eal (0.4)4 (0.6)° 
0.1536 + 0.0256 = 0.1792 


W 


(vi) P(X283) 


Example 9.5, 
Given n= 6, p= 1/3. Find 
(a) P[X=-1] (b) P[X=2.5] (c) P[X=2] (d@) P[X= 10]. 
Solution: 
The probability of x successes in a series of n trials is given by 


P[X=x] = es p* qr-* X= OFT ai 8) eee ns 
Here, n=6,p = 13, q=1=p'= 2 and x=0, 1, 2, 3, 4, 5, 6. 


P[X=x] = ray x=0, 1, 2,3,..,6. * 


a) PL X=s1 ] = 0, because a random variable X in a binomial distribution 
takes only positive integral values. 

ob) P[X=25 ] = 0, because X can take only integer values 0, 1, 2, 8, 4, 5, 6. 

E 6)(1)2(2)\4 240 

Wax Sz 1's (2)(3) (3) = 729 

‘d) P[X=10] = 0, because X can take only values 0, 1, 2, 3, 4, 5, 6. 


9.7 MEAN, VARIANCE AND STANDARD DEVIATION OF THE 

BINOMIAL DISTRIBUTION 

The mean, variance and standard deviation of the binomial distribution, means 
the mean, variance and standard deviation of the values taken by the variable in 
‘epeated binomial experiments. Instead of carrying out these experiments in order 
0 calculate the mean, variance and standard deviation, it can be shown 
nathematically that the following formulae may be applied to any binomial 
listribution. 

Mean = p = =Ixp(x) = np 


Variance = o? = Ex? p(x) — [Ex p(x)]? = npq 
SD Hey =x? p(x) — [Ex p(@x)]? = Vnpq 
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Example 9.6. 
Find the mean, variance and standard deviation of the binomial distribution 
(at+p) 
Solution: 
Here, n=2, x = 0,1, 2. Therefore 
P[X=x] = (2) pa x'=0, 1, 2: 
2 0 42 2 2 
P[X=0] = (o)p°a?= q@ P[X=1] = (;) pa = 2pq 
2 


P[X=2] = i) pq? = p* 


Thus, the probability distribution in tabular form for the computation of mean, 
variance and standard deviation of X is given as follows: 


E(X) = p = Exp(x) = 2pq + 2p? 
= 2p(qa+p) = 2p(1) = 2p 
E(X?)= =x’ p(x) = 2pq + 4p? 


Var(X)= o° = E(X*)-[E(X)]? = 2pq + 4p?- (2p) 
= 2pq+4p*—4p? = 2pq 
S.D(X)=.6 = V2pq 
Example 9.7. 


Show that the mean = 3p, variance = 3pq and standard deviation = \3pq for a 
binomial distribution in which n = 3. 


Solution: 
Here, n= 3, x = 0,1, 2,3. Therefore 
P[X=x] = (3) pa? x=0, 1, 2,3. 
3 3 
P[X=0] = (3) pa? = ¢ P[X=1] = (3) pq = 3pq° 
ae 3 
P[X=2] = (3) pa = Bp’q P[X=3] = (3) p'q’= p* 
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Thus, the probability distribution in tabular form for the computation of mean. 
variance and standard deviation of X is given as follows: 


E(X) = 


E(X*) = 


Var( X )= 


S.D( X ) 
Example 9.8. 


Prien 
q 0 


| | ep | 
0 


h = Ex p(x) = 8pq? + 6p*q + 3p? = 8p(q? + 2pq + p?) 
3p(qq+p)’ = 3p(1)? = 3p 

Ex’ p(x) = 3pq’ + 12p*q + 9p* = 3p(q’ + 4pq + Sp*) 
3p[ q? + 2pq + p? + 2pq + 2p*] = 3p[ (q+ p)* + 2p(q + p) ] 
3p (1)? + 2p(1) ] = 8p[ 1+ 2p ] = 3p + 6p? 
E(X?)-[E(X) |= 3p + 6p?— (3p)? = Sp + 6p? — 9p’ 
3p — 8p” = 3p(1 - p) = 8pq 


o = \3pq 


Find the mean, variance and standard deviation of the binomial distribution 


(q+p)*. 
Solution: 
Here,| .n 


P[X=x] 


P[X=0] 
P[X=3] 


x 
4 
= (43 pig Sx 0 eee, 
4 


PL X=0)="(4pq") 1 PL X= 2) = Gp? 
Fania. | PLEX = got 


i 
2 


variance and standard deviation of X is iven as follows: 


Thus, the probability distribution i 1 tabular form for the computation of mean, 


Basic Statistics Part 

E(X) = p = xx P(X) = 4pq? + 12p%q? + 12p*q + 4p* 

= 4p(q° + 3pq? + 3p%q + p%) = 4p(q + p)? = 4p(1)° = 4p 
E( X*) = xx?pm@) = 4pq? + 24p2q? + 36p°q + 16p4 

= 4p[q? + 6pq? + 9p%q + 4p9] 

= 4p[ q+ 3pq? + 3p%q + p+ 3pq? + 6pq + 3p*] 

= 4p[ (q+ p)? + 3p(q? + 2pq + p4)] : 

= 4pl (a + p)* + 3p(q + p)*] = pf (1)? + 3p(1)*] 

= 4p[1+ 3p] = 4p +12p? 
Var(X )= 07 = E(X)'-[E(X)}P = 4p + 12p? — (4p)? 

= 4p + 12p*— 16p? = 4p — 4p? = 4p(1 —p) = 4pq 
S.D(X) = ¢ = vapq 
Example 9.9, 


Find the mean, variance and standard deviation of the binomial distribution 
+p)". 
Solution: 


The number of successes x will be 0, 1, 2, 3, 4, ..., n. We know that 
n n “ n . n ys n = 
(tp = (2) prat+(*) pq” 14(2) pq” 2+(3) pq” "4(2) p‘qr* 
n 0 
tat ee p"q 


Therefore the probabilities of 0, 1, 2, 3,4, ..,n Successes are respectively, 
nm -1(n n-2(n n-3 (0 a a 
qa’, Gy Pq" 2) p*q aft) p*q <1} p‘q”, ..., p 


Thus, the probability distribution in tabular form for the computation of mean, 
variance and standard deviation of X is given as follows: 


ey 


( 


i) 
ss 
Nae? NES 


Ap OF VB eS 


3 
Tae 


B(X)= w= Exp@= (7) pat!+2(2) ptart+a(2) prgr-? 


+ (7) p*q’-*+... + np” 


= np[a"-1+ (n—1) pat? + 2 


3! a 
=npfa’!+("7 *) pat-2+ (751) agra + (957 )ptat t+. tet] 
= np [q@+p)"""] = np[(1)"]= np 


E(X?) = Ix’ p(x) = (2) pat! 4( 2) piat-?+ (2) p*q’=* 


+ 16 2) pig’"4+... + n?p" 


ee n(n—1) = n(@=1)i(m=2) , ,.. 
= npq’ Lp 31 p’q" Lies 9 31 p’q' 3 


ane n= in=2) (a=) piqt=4 +... + n2p" 


S 5 @-1)(n-2) , 4 
oxfa? 14 2(n= 1) pa’? +38 5 —— prgh? 


PH MOBO D ret 4 ag 


es nal n—1)(n—2) ., ,_ 
np|{a" "+ (a-1) pq’? + 3 pq’? 


+ Ga DO—D OAD rayety sae pr} 


f fM=)m-2) , 
+ {0-0 pa 249M ae p’q'? 


@=1)@=2)-3) , n- 
+3% - a2 “ a 44.4 (n—1)p""! 


np [(q +p)! + (n= 1) p{(q?-? + (n— 2) pq? 


4 SHBDO=9) ana y on +p] 


2! 
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= np[(q+py-!+ (n-1) plat-*+("7?) pqs 


“Ca tet] 
= np [(q+p)"-'+(n—1) pq+py-"]= np [a)-' + (= 1) pay?-4] 
= np [1+ (n—1)p] = np[1+np—p] =np + n2p?—np? 

Var(X)= o? = E(X*)-[E(X)]? = np + n*p? — np? — (np)? 
= np + n’p?~np*—n4p?= np — np*= np(1— p) = npq 

S.D(X) = o = Vnpq 

Alternative method 


The binomial random variable X with parameters n and p has the probability 
distribution 


P[X=x]= b(n,p) = (3) Didtae bee tog: 


mnie spcie Biel S}prt= 
( )= Ex Pe) Eee 
5 Hs inblite igh epg tl n(n— 1)! 52E nx 
wept xia— x)! oy x ep ae)! (n-x)! PP 


n n-1)! Ly ae 2 (n-1 se 
= mp 2 eB gag 5 a3) etgex 


x=1 


=mo[ar-t+(")*)pat—#+("51) start. ep 
=np[(q+p)""'}=np[(1)"-"] = np 
E(X?)= E[X(X-1)]+E(X)=E[X(X-1)]+np 


aa ome a @rre 
ere E[X(X-1)]= xzo  DPG) = 2 xx -1) zea 


x 


n! au 
yey og x!(n — x)! P*q° 


Ee, n(n — 1) (n ~ 2)! 20x-2 n—x 
Seo P xe ao (eat PP 7 

n n—2)! Boa. 
Cee igaaracn Poe 


n (n—-2 
= n(n-I)p? 5 (223) ptgh 
x=2 


=2\ 4 a 
+("3 part tpt’) 
= n(n Dp? [a + p)"-?] = n(n — 1) p*[()"~4] = an — 1) p’ 
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So, E(X?) = n(n— 1) p’+np = n’p? — np? + np 
Var(X)= 07 = E(X*)-[E(X)]? = n’p* — np? + np ~ (np)? 
= n’p’—np*+np—n'p?= np—np? = np(1—p) = npq 
S.D(X)='o = Vnpq 
Example 9.10. 
If X is a binomial random variable with E(X) = 1.44 and §.D(X ) = 0,96. Find 
the parameters of the binomial distribution. Also find P[X = 2]. 
Solution; 
Here, E(X)=np= 1.44, S.D(X) = Vnpq = 0.96 
Var( X ) = npq =(0.96)?= 0.9216 
1.44q = 0.9216 (since np = 1.44) 
or q= oseig = 0.64 and P=1T=q = 0:36 
np = 1.44 or n(0.36) = 1.44 (since p = 0.36) 
1.44 
n= 036. > 4 


4 
P[X=2]= (4) 0.36)? 0.64? = 0.3185 
Hence, n = 4, P = 0.86 and P(X=2) = 0.3185. 
Example 9.11, 


Is it possible to have a binomial distribution with mean = 10 and standard 
deviation = 6? 


Solution: 
Here, Mean= p = np = 10 
SD(X)= o = Vnpq= 6 
Var(X) = o? = npq = (6)*= 36 


10q = 36 (since np=10) or q — = 3.6 


This is impossible because q is a probability which can never exceed one. Hence 
the given values of mean and standard deviation are wrong. 
Example 9.12, 

Given n= 65, P(X = 1) = 5/32 and RX=2) = 10/32, Find P(X =0) and 
P(X=3). Also find coefficient of skewness, f° 


Solution; 
Ben 4 4 5 
ETS Tee aba = cepa = 8200 ot (1) 
F Md 
P[X=2] = ee p’q? = 10p%q3 = 2 sites (2) 
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Dividing equation (2) by equation (1) 


lop?a* 10/32 ope od 2 hE 

5 pa! 5/32 q = or 2p = 2qor p=q= 9 
5\(1\)(1\) 1 

pix=01 = (5)(3)(3) = as 
5)\(1(1\ 10 

LBS) oF Gra) i) 32 

Coefficient of skewness = seed = 0, there is no skewness, thus the 


distribution is symmetrical. 
9.8 PROPERTIES OF THE BINOMIAL DISTRIBUTION 
Properties of the binomial distribution are listed in the following table. 


| Mean u=np 
Variance ‘ o” = npq 
= Deviation o=\npq 
Moment Coefficient of Skewness y= pi = aa | 
npq 


Moment Coefficient of Kurtosis p,=3 + a | 


npq 


Note: Skewness is negative, zero or positive according as p> I/2or p = 1/2 or 
p< 2. 


9.9 BINOMIAL FREQUENCY DISTRIBUTION 
Let the n independent trials constitute one experiment and let this experiment 


. n ee us . 
be repeated N times. Then we expect x successes to occur n( ) p*q" ~* times. This 


will be called the expected frequency of x successes in N experiments and the 
possible number of successes together with the expected frequencies will be said to 
constitute the binomial (expected) frequency distribution. In practice, the observed 
frequencies will differ from the expected frequencies due to chance causes. For N 
sets, each of n trials the expected frequencies of 0, 1, 2, ..., n successes are given by 
the successive terms in the binomial expansion of N(q+ p)", q+ p=1. 
9.10 FITTING OF THE BINOMIAL DISTRIBUTION 

Suppose there are some intelligent and some non-intelligent students in a big 
college but we do not know the percentage of intelligent students. A random sample 
of 6 students is selected and the number of intelligent students in the sample is 
counted. This number will take any velue between 0 to 6. We sepeat the sample a 
large number of times say 100 and each time note the number of intelligent 
students. The frequencies of 0, 1, 2, ..., 6 intelligent students as observed in this 
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experiment are, say, 14, 15, 25, 25, 10, 6, 5. The sample data can be written as 
below: 


No. of intelligent it 
students (X) 
[recnueey (Ya fe] 


The sample mean (X) = HX = 240 = 2.4. This observed data is also called 


experimental or actual data. From this data we want to estimate the Proportion of 
intelligent students in the college. According to the theory of estimation, we find the 
sample mean = 2.4 and substitute it equal to np when n = 6. Thus 6p = 2.4 and 
Pp = 0.4. This value of p is called estimate of proportion of intelligent students in the 
college. When p = 0.4 then q=1l-p=1-04=06, Using n= 6, p= 0.4 and q=0.6 


No. of intelligent Expected Frequencies 
students(X) | px) = (Fi (0.4)* (0.6)°-* N.p(x) 
: (0.4)" (0.6)° = 0.046656 4.67 
1 ( Hd (0.4) (0.6)° = 0.186624 18.66 
2 : (0.4)? (0.6)* = 0.311040 31.10 
3 - (0.4) (0.6) = 0.276480 | 27.65 
: (0.4)* (0.6)? = 0.138240 13.82 
‘ (0.4)° (0.6) = 0.036864 3.69 
‘ (0.4)° (0.6)° = 0.004096 
100 


This procedure of finding expected frequencies of the binomial distribution is 
called fitting of the binomial distribution. It is basically the method of estimating 
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probability of success p(parameter of population) from the sample data and then use 
it to find the expected frequencies (most likely) of the given experiment. 
Example 9.13. 

The probability of male birth is equal to the probability of female birth. Out of 
400 families with 4 children each, find the expected number of families with 0, 1, 2, 
3 and 4 males. 
Solution: 

"The binomial frequency distribution is 


NP[X=x]= Nb&xn,p) = n(®) pra'-* x=0,1,2,3,..,n. 
Here, N = 400, n=4, p = 1/2, q = 1-p =1/2 and x=0,1,2,3,4. 


400P[X=0] = ao0(4)(3) (3) = 25 Families 
400 P[X=1] = 4oo( 4)(3) zy = 100 Families 
400 P[X=2] = aoo(4)(4¥ (3) = 150 Families 
400P[X=3] = aoo(4)(4)(3) = 100 Families 
400P[X=4] = aoo(4)(2)'(3) = 25 Families 


Example 9.14. 
Six dice are thrown 729 times. How many times do you expect at least three 
dice to show a five or a six? 


Solution: 
The binomial frequency distribution is 
NP[X=x] = Nbn,p) = n(2) p¥q’-® x=0,1,2,3, 
Here, p= 13, g = 1-p = 2/3, n= 6 and N = 729. 


Let the random variable X denote the number of dice to show a five or a six, 
then the possible values of X are 0, 1, 2, 3, 4, 5, 6. Therefore 


x(2)\6- 
729P[X=x] = 729(°)(3) (3) * 5 =0, 1,2, -, 6 


The expected number of times a least three dice will show a 5 or a 6 is 
= 0 5)(5) (3) + a0 2)(5) (3) + SNC) ) 
= 729(§ 3) \3 +729\ 4)(3) (3 +729\ 5 (3 3 
6 1)\6( 20 
+729(6)(3) Gi 
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Example 9.15, 


Fit a binomial distribution to the following data, obtained by tossing a biased 
coin 5 times, 


No. of heads (X) 
Frequency (f ) 


Here, <f = 200, XfX = 898, on S65 


= LfX 398 
X = np= 7 es g00 = 199 
or 6p = 1.99 (since n = 5) 
199 
Ore pA = “5 = 0.398 and q = 1-p = 1~-0.398 = 0.602 


Hence the binomial distribution to be fitted to the data is 
200P[X=x] = 200( 5) (0.398)* (0.602)°-* 


The expected or theoretical frequencies of 0, 1, 2, 3, 4, 5 successes ar2 
calculated as below: 


No. of heads 


Probability Expected Frequencies 


(X) P[X=x] N.P[X=x] 
0 ‘ (0.398)° (0.602) = 0.079065 15.81 or 16 
a (a (0.398) (0.602)' = 0.261360 52.27 or 52 
2 ; (0.398)? (0.602)? = 0.345586 69,12 or 69 
3 (0.398)° (0.602)? = 0.298477 45.70 or 46 

( ca (0.398) (0.602) = 0.075526 15.10 or 15 
4 (0.398)° (0.602)° = 0.009987 2.00 or 2 
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Example 9.16. 

A certain event is believed to follow the binomial distribution. In 1024 samples 
of 5, the result was observed once 405 times and twice 270 times. Calculate the 
probabilities p and q. 

Solution: 
The binomial frequency distribution is 


NP[X=x] n(*) pigh-* X= 0) 1,2) 3... 0, 
Here, N = 1024, n = 5, x=1 and 2. 


1024.P[X=1] = 1024(5) pq’ = 405 

1024.P[K=2] = 1024 ie p’q? = 270 

5120 pq’ = 405 2. ee (1) 

1020 pat =) 2T0. >  o e (2) 
Dividing equation (2) by equation (1), we get 

10240 p*4° 270 oy 2 

5120pq¢ 405 % q 3 


or 6p = 2q or Gp= 2(1—-p) 2-—2p or 6p+2p = 2 


2 
or 8p/= 2 or pe = go 4 and q = 1-p = 3/4 


Hence, p = 1/4 and q 3/4 
9.11 HYPERGEOMETRIC DISTRIBUTION 

If the probability of a success is not constant, the hypergeometric distribution 
is particularly useful. Suppose we have N distinct objects divided into two classes, 
say a class of successes and a class of failures. Suppose there are k successes and 
N-k failures. We take at random a sample of size n and ask for the probability that 
exactly x of the objects in it are successes and n — x failures. The probability 
distribution of the hypergeometric random variable x is 

(ee of successes in the pore) ee of failures in the agg 
oe Taken x at a time Taken n—x at a time 
B(X=x) = (see ino 
Taken n at a time. 

where, x=0, 1, 2, 3, ..., n or k (whichever is less), x < k, n-x<sN-k 


or 
ca eae 
x n—-x : : 
POX =x). = hG; NB = "tee x=0,1,2,3, -.,n or k (whichever is less) 
(*) x<k,n-x<sN-k 


If P( X =x) is the probability distribution, it should satisfy 
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J 
wh Alle hg a 


a 
@ P(X=x) 2 0 (ii) Sum of probabilities = De ee) = (*) = 


n n 
Note: (1) Hypergeometric distribution has three Parameters i.e; N, n and k, 


number of successes in the population whichever ig smaller, 

(3) When N is large, hypergeometric distribution approaches the binomial 
distribution. 

(4) Like the binomial distribution, the hypergeometric distribution may also 
be symmetrical or skewed. Whenever Pp = 1/2, the hypergeometric 
distribution will be symmetrical regardless of how large or small the 
value of n; however, when P # 1/2, the distribution will be skewed. 

9.12 HYPERGEOMETRIC EXPERIMENT 
An experiment in which a random sample is selected without replacement 
a known finite Population and contains a relatively large proportion of the 
population, such that the probability of a success does not remain constant from trial 
to trial is called a hypergeometric experiment. 

A hypergeometric experiment has the following properties (qualities): 

@ Each trial results in two outcomes which can be classified into success 
and failure. 

(ii) The successive trials are dependent, 

(iii) The probability of cach outcome does not remain constant from trial to 
trial, 


9.13 PROPERTIES OF THE HYPERGEOMETRIC DISTRIBUTION 
(1) The mean and variance of the hypergeometric distribution are as follows: 


= BOK) BS gaa arm gt a HEUN Be) —n 


te N?(N-1) 
If we set x = p, then the mean of the hypergeometric distribution coincides 
with the mean of the binomial distribution and the variance of the 
hypergeometric distribution is (= ) times the variance of the binomial 


distribution, 
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(2) When N is large, hypergeometric distribution approaches the emoniel, 
distribution. 

Example 9.17. 


An urn contains nine balls, five of them red and four blue. Three balls are 
drawn without replacement. Find the probability distribution of X = number of red 
balls drawn. 


Solution: 
The hypergeometric distribution is 


iene 
<3 


Here N=9,n=3,k=56, x= number of red balls then the possible values of x are 


0, 1, 2, 3. Therefore 
5 9-5) 
3- 
nacvninans AE x=0, 1, 2,3. 


G8. 8a 
CRaaCliia 
(8228) (0) ao | py tere) XO), 0 


Oo : 


Thus the hypergeometric probability distribution of red balls in tabular form is 
given as follows: 


Pegeer oem 


Example 9.18 


There are seven people who work in an office. Of the seven, four would like to 
be transferred. If three people from this office are randomly selected for transfer, 
what is the probability that: 


G) All three will want to be transferred? 

(ii) Two of the three will want to be transferred? 
(iii) At least two will want to be transferred? 

(iv) At most one will want to be transferred? 


P(X=x) = h@N,n, k= x=0, 1, 2,3,--,nork (whichever is less) 
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Solution: 
The hypergeometric distribution is 


Uy) 


P(X =x) = veya x= 0, 1, 2, 3,...,n or k (whichever is less) 
Nabe 


Here N=7,n=3,k= 4, x = number of people who wanted the transfer, then 
the possible-values of x are 0, 1, 2 and 3. Therefore 


7 


P(X) = (2) x=0, 1, 2,3, 
a, 3 
S(4yr-4) (ays 

@ acta oo Oe = 4 = 011s 

N 47-4 4/3 
Gi) P(X=2)= tied . ut = 3p = 05143 

3 3 

Gi) P(X22)= P(X=2)+P(x=3) = 8 4 4 = 2 = 0.6086 


(oXs=3) CNG21) 


Gv)  P(Xs1)= P(X=0)+P(X=1) = Tee 
(3) 
Es 


Re 2} e180 
= 35 5 = 35 = 0.38714 


Example 9.19 

A box contains ten items, seven of which are good and three are defective. Two items 
are selected (without replacement); compute the probability distribution for the 
number of defectives in the sample of two, Compute the mean and variance of this 


probability distribution. Is this mean equal to a and variance ui Ee 7 2 5 


Solution: 
The hypergeometric distribution is 
(oes) 
x/\n~-x : : 
P(X=x) rg be x=0, 1, 2, 3, ..., n or k ( whichever is less ) 


n 
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Here N = 10, n = 2, k = 3, x = number of defective items, then the possible 
values of x are 0, 1 and 2. 


Therefore 


nen OLE). GUO). 2 
es Die 


Thus the probability distribution in tabular form for the computation of mean 
and variance of X is given as follows: 


P(K = x) = p@) 
TAB 
TAS 
V/15 


Zp(x)=1 =x p(x) = 9/15 


Mean = E(X) = Ix p(x) = 4 = 0.6 


=x? p(x) = 11/15 


u 79 \*_ 28 
Var (X) = 02 = Ex? p(x) —[ Ex ps) ? = 75 -(45) = 75 = 0:3738 


nk _ @(3) — nk(N-k)(N=n) _ 28)0@ _ 

No ao ee and N?(N-1) = 1009) = 0.3733 
k k(N— = 

Hence B(X) = SE and. Van) = eR 
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SHORT DEFINITIONS 

Bernoulli Trial 

A trial that gives only two possible outcomes is called a Bernoulli trial. 

Binomial Experiment 


An experiment with n independent trials in which the outcomes can always be 
classified as either a success or a failure and the probability of success remains 
constant from trial to trial is called a binomial experiment. 


Properties of a Binomial Experiment 
(i) The experiment consists of n identical trials. 
(ii) The trials are independent. 


(iii) Each trial can result in one of only two possible outcomes, called success and 
failure. 


(iv) The probability of success p is constant from trial to trial. 
(v) The random variable X represents the number of successes in n trials. 
Binomial Probability Distribution 


A distribution that gives the probability of x successes for a fixed number of 
independent trials, where each trial must haye two possible outcomes and the 
probability of a success is constant from trial to trial, is called binomial probability 
distribution. 


or 


A probability distribution showing the probability of x successes in n trials of a 
binomial experiment is called binomial probability distribution. 


Binomial Probability Function 


The function used to compute probabilities in a binomial experiment is called 
binomial probability function. 


Binomial Formula * 
ies P(X = x) = b(x; n, p) = ag peqne* x=0,1,2,3,-,n 


Hope 


n= Sample size x = Number of successes 
n-x= Number of failures p = Probability of a success 


q=1-p = Probability of a failure n!=n(m-1)(n-2) + 3.2.1 


0! = 1 ( by definition ) 
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Hypergeometric Experiment 
An experiment in which a random sample is selected without replacement from a 
known finite population and contains a relatively large proportion of the population, 
such that the probability of a success does not remain constant from trial to trial is 
called a hypergeometric experiment. 
or 

An experiment in which a random sample is selected without replacement from a 
finite population in such a way that each trial is a Bernoulli trial and probability of 
success does not remain constant on each trial, is called a hypergeometric 
experiment. 
Properties of a Hypergeometric Experiment 
(i) The experiment consists of n identical trials. 
(ii) The successive trials are dependent. 
(iii) Each trial can result in one of only two possible outcomes, called success and 

failure. 
(iv) The probability of each outcome does not remain constant from trial to trial. 
Hypergeometric Probability Distribution 
Suppose a population consists of N items which are classified as k successes and 
N-k failures. If we select a sample of n items from the population without 
replacement in such a way that x successes are selected from k successes and n — x 


failures are\selected from N — k failures. The probability distribution defined in this 
situation is A Hypetzesiie nic distribution which is given by 


k)\(N-k 
P(X =x) = h(x; N, n, k) = ane x =0, 1, 2, 3, ---, n or k (whichever is less) 


a x<k, n-x<sN-k, n<N. 


MULTIPLE ~ CHOICE QUESTIONS 


1. A Bernoulli trial has: 


(a) at least two outcomes (b) at most two outcomes 
(c) two outcomes (d) fewer than two outcomes 
2. The two mutually exclusive outcomes in a Bernoulli trial are usually called: 
(a) success and failure (b) variable and constant 
(c) mean and variance (d) with and without replacement 
8. Nature of the binomial random variable X is: 
(a) quantitative (b) qualitative 


(c) discrete (d) continuous 
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4. 


5. 


9. 


10. 


11, 


12. 


13. 


14, 


15. 


In a binomial probability distribution, the sum of probability of failure and 
probability of success is always: 


(a) zero (b) less than 0.5 
(ce) greater than 0.5 (d) one 

In a binomial experiment, the successive trials are: 

(a) dependent (b) independent 
(c) mutually exclusive (da) fixed 

The parameters of the binomial distribution are: 

(a) nandp (b) pandg 

(c) np and ng (d) np and npq 
The range of binomial distribution is: 

(a) Oton (b) Otow 

(c) -lto+#1 (ad) Otol 


The mean and standard deviation of the binomial probability distribution are} 
respectively: , 


(a) np and npq (b) np and vnpq 

(ce) np andnqg (d) nandp 

In a binomial experiment with three trials, the variable can take: 
(a) 2 values (b) 3 values 

(ce) 4 values (d) 5 values 


The shape of the binomial probability distribution depends upon the values of 
its: 


(a) mean (b) variance 

(c) parameters (d) quartiles 

In binomial distribution the number of trials are: 

(a) very large (b) very small 

(c) fixed (d) not fixed 

In a binomial probability distribution, relation between mean and variance is: 
(a) mean < variance (b) mean = variance 

(c) mean > variance (d) difficult to tell 


In binomial distribution when n = 1, then it becomes: 
(a) hypergeometric distribution (b) normal distribution 


(c) uniform distribution (d) Bernoulli distribution 

The mean of a binomial distribution depends on: 

(a) number of trials (b) probability of success 

(c) probability of failure (d) number of trials and probability of success 
The variance of a binomial distribution depends on: 

(a) number of trials (b) probability of success 


(c) probability of failure (da) allof the above . 


"CAB, 


17. 


18. 


19. 


20. 


21. 


22. 


23. 


24, 
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Which of the following is not property of a binomial experiment? 
(a) Probability of success remains constant 

(b) nis fixed 

(©) Successive trials are dependent 

(d) It has two parameters 

The binomial probability distribution is symmetrical when: 


(a) p=aq (bt) p<aq 

() p>a (@)  np>npq 

The binomial distribution is negatively skewed if: 

(a) p<1/2 (bo) p=1/2 

() p>we2 (@) p=1 

In a binomial probability distribution, the skewness is positive for: 
(a) p<1/2 (b) p=1/4 

(c) np=npq (@) np=ng 


Which one of the following statements is false? 

(a) Expected value of a constant is the constant. 

(b) Ina binomial distribution the standard deviation is always less than its 
variance. 


(c) Ina binomial probability distribution the mean is always greater than its 
variance. ; 


(d) In binomial experiment the probability of success remains constant from 
trial to trial. 


If a binomial probability distribution has parameters (n, p ) = (5, 0.6), the 
probability of X = 3.5 is: 


(a) 0 (b) 1 
(©) 0.6 (@) 0.4 


In a binomial experiment n = 4, P(X = 2) = 216/625 and P(X = 3) = 216/625. 
P(X=-2) is: 


(a) 216/625 (b) 1 

(c) 0 (d) difficult to tell 
Ifn=6 and p = 0.9, then the value of P( X = 7 ) is: 

(a) zero (b) less than zero 
(c) more than zero (d) one 


In a binomial probability distribution, coefficient of skewness = (q-p)Nnpq 
= 0, it means that the distribution is: 


(a) symmetrical (b) skewed to the left 
(c) skewed to the right (ad) highly skewed 
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25. 


. 26. 


27, 


28. 


29. 


30. 


31. 


32, 


33. 


84, 


For a binomial distribution with n = 10, p = 0.5, the probability of zero or more 
successes is: 


(a) 1 (b) 0.5 
(c) 0.25 (d) 0.75 

In a binomial distribution, the mean, median and mode coincide when: 
(a) p<2 (b) p> 12 

(c) p#l/2 (dd) p=1/2 


In which distribution, the probability success remains constant from trial to 
trial? 

(a) Hypergeometric distribution (b) Binomial distribution 

(c) Sampling distribution (d) Frequency distribution 

In a binomial experiment when n = 5, the maximum number of successes will 
be: 

(a) 0 (b) 2.5 
() 4 @) 5 
In a binomial experiment when n = 10, the minimum number of successes will 
be: 


(a) 0 (b) 5 
(c) 10 Koi 1. 
Ifn= 10 and p = 0.6, then P( X 20) is: 

(a) 0.5 (b) 0.6 
() 1.0 (d) 12 


A random variable X has a binomial distribution with n = 4, the standard 
deviation of X is: 


(a) 4pq (b) 2Vpq 
() 4p (d) 4(q+p) 


In a multiple choice test there are five possible answers to each of 20 questions. 
If a candidate guesses the correct answer each time, the mean number of 
correct answers is: 


(a) 4 (b) 5 

() 1/5 (d) 20 

If three coins are tossed, the probability of two heads is: 
(a) 1/8 (b) 3/8 

(c) 2/3 (da) 0 


A random variable X has binomial distribution with n = 8 and p = 1/2. The 
most probable value of X is: 


(a) 2 (b) 3 
(ce) 4 (dd) 5 
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35. 


36. 


37. 


38. 


39. 


40. 


41. 


42. 


43. 


44, 


45, 


The value of second moment about the mean in a binomial distribution is 36. 
The value of the standard deviation of a binomial distribution is: 

(a) 36 (b) 6 

(c) 1/36 qd) 1/6 

For a binomial probability dist: ‘bution, the expected frequency of x successes in 
N experiments is: 


(a) Gere (b) p*(1-p)!-* 


() N(2) peq-* (a) (2) pel — pyre 


In a binomial frequency distribution 100 (1/5 + 4/5)5 The parameters n and p 
are respectively: 


(a) (5,1/5) (b) (1/5, 4/5) 

(c) (100, 4/5) (d) (5, 4/5) 

For a binomial frequency distribution 100 (1/5 + 4/5), the mean is: 
(a) 1/5 () 4/5 

©) 5 @) 4 


For a binomial distribution (1/3 + 2/3)18, the standard deviation of the 
binomial distribution will be: 


(a) 2 (be) 4 

() 6 qd) 12 

The hypergeometric distribution has: 

(a) one parameter (b) two parameters 
(c) three parameters (d) four parameters 
The parameters of the hypergeometric distribution are: 
(a) N,n,p (b) N,n,np 

() N,n,k (d) nandp 

Nature of the hypergeometric random variable is: 

(a) continuous (b) discrete 

(c) qualitative (d) quantitative 

In hypergeometric distribution, the successive trials are 
(a) independent (b) dependent 

(c) very large (a) very small 


In a hypergeometic distribution, the probability of success: 

(a) remains constant from trial io trial 

(b) does aot remain constant from trial to trial 

(c) equal to probability of failure 

(a) less than probability of failure 

If in a hypergeometric distribution N = 10, k = 5 and n = 4; then the probability 
of failure is: 

(a) 2 (b) 0.5 

@ 1 (@) 0.25 


47. 


48, 


49. 


50. 


51. 


52. 


58. 


54. 


55, 


56. 
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The range of hypergeometric distribution is: 


(a) Oton (b) Otok 

(c) OtoN (d) 0 ton or k (whichever is less) 
The number of tridls in hypergeometric distribution is: 

(a) not fixed (b) fixed 

(c) large (d) small 

The probability of a success changes from trial to trial in: 

(a) binomial distribution (b) hypergeometric distribution 
(c) normal distribution (d) frequency distribution 

The mean of the hypergeometric distribution is: 

(a) a (b) Me 

q @ @ atk 


The standard deviation of the hypergeometric distribution is: 


nk(k~N)(N-n) nk( N~k)(N-n) 
(a) N2(N-1) (b) NN-1) 


nk( N~k)(n-N) nk(N-k)+(N-n) 
© NXN+1) (a) NXN-1) 
In hypergeometric probability distribution, the relation between mean and 
variance is: 
(a) mean > variance (b) mean < variance 
(c) mean = variance (d) mean = 2 variance 
Which of the following is the property of hypergeometric experiment? 
(a) p remains constant from trial to trial 
(b) Successive trials are independent 
(c) Sampling is performed without replacement 
(a) nis not fixed 
Hypergeometric distribution reduces to binomial distribution when: 
(a) N=n (b) now 
() Noo (d) N<n 
In a hypergeometric distribution N = 6, n= 4 and k= 8, then the mean is equal 
to: 


(a) 2 (b) 4 

() 6 (d) 24 

Given N= 11,n=5,k=7; P(X21) equals: 

(a) 1 (b) 1/66 

(c) 65/66 (d) none of the above 
Given N= 12,n=5,k=4; P(X <4) equals: 

(a) less than one (b) exactly one 


(c) more than one (d) between 0.5 and 1 
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SHORT QUESTIONS 


Qi 
Q2 
Q.3 


Q4 
Q.5 
Q.6 
Q.7 
Qs 
Qo 


Q.10 
Q.11 
Q.12 
Q.13 
Q.14 


Q.15 


Ans. 
Q.16 


Ans. 
Q.17 


Ans. 
Q.18 


Ans. 


Define a Bernoulli trial. 
Define binomial experiment. 
What properties must an experiment possess to be classed as a binomial 
experiment? or State the properties of a binomial experiment. 

What is a binomial experiment and what are its properties? 

Define binomial probability distribution. 

Define binomial probability function. 

Describe the conditions of a binomial experiment. 

State the formula used to calculate binomial probabilities? 

Which formulae should be used to find the mean, variance, standard 
deviation and coefficient of variation of a binomial random variable? 

Write down the properties of the binomial probability distribution. 

What is a binomial distribution and what are its properties? 

What are the parameters of a binomial distribution? 

Describe the binomial distribution. 

Describe the binomial frequency distribution. 

If X is a binomial random variable with n = 4 and P( X= 2) =3P(X=3). 
Find p 

V3 

A random variable X is believed to follow a binomial distribution with 
b(x, 5, p). If P(X = 0) = 243/ 1024, find P(X =3). 

0.0879 

In a binomial experiment n = 5, P( X = 1) = 5/32 and P( X = 2) = 10/82. Find 
P(X=5). 

1/32 

A random variable X has a binomial distribution with n = 5 and p = 0.2. Find 
P(X=2). 

0.2048 
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For a binomial distribution with n = 10 and p = 0.5. Find the probability of 5 
successes. 

0.2461 

If X is a binomial random variable with mean = 2.4 and variance = 0,96, Find 
P(X=0). 

0.0256 

In a binomial distmbution consisting of 5 independent trials, probability of 2 
and 3 successes are 0.2048 and 0.0512 respectively. Find the value of 
variance. 

0.8 

If X is a random variable with a binomial distribution and its mean equals 
3.2 and its variance equals 1.92. Find the probability that all successes are 
observed. 

0.0007 

In a binomial distribution consisting of 5 independent trials, probabilities of 1 
and 2 successes are 0.4096 and 0.2048 respectively. Find the expected value. 
1 

In a binomial distribution mean = 2.4 and standard deviation = 1.2. Find the 
value of n. 

6 

A random variable X has binomial distribution with E(X) = 2.4 and p = 0.3. 
Find the standard deviation of X. 

1,30 

If a binomial probability distribution has parameters (n, p) = ( 20, 0.6 ). Find 
the expected number of successes and standard deviation. 

(12; 2.19 ) 

If X is a binomial random variable with n= 10 and p = 0.4, Find $.D.(8 — 2X). 
3.1 

If X is a binomial random variable with mean = 1.44 and standard deviation 
= 0.96. Find P(X = 2). 

0.3185 

Given n.= 5, P(X = 0) = 1/32 and P( X = 3) = 10/ 82. Find coefficient of 
skewness, 

zero 

Discuss the statement that in a binomial distribution mean = 5 and standard 
deviation = 5. 

Impossible because q = 5. 

In a multiple choice test there are five possible answers to each of 20 
questions. If a candidate guesses the correct answer each time, find standard 
deviation of correct answers. 

1,79) 
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Q.382 It is estimated that 1 in 20 students are left-handed. What sample size 

: should be taken to ensure that the expected number of left-handed students 
in the sample is 3? Also find the value of parameter o. 

Ans. n=60ando=1.69 

Q.33 The mean and second moment about the mean in a binomial distribution are 
10 and 6 respectively. Find the value of n. 

Ans, 25 

Q.34 If the probability that a day is fine is 0.6. Find the expected number and 
standard deviation of the fine days in a week. 

Ans. (4.2, 1.30) 

Q.35 If X is a binomial random variable with n = 10 and p = 0.4. Find mean and 


2. 
variance of Y = a= 2x 
Ans. -1land0.384 ° fe 


Q.36 Discuss the statement that in a binomial distribution mean = 5 and 
variance = 5, 

Ans. Since q is a probability of failure which is equal to one. In this case p = 0 andn 
is undefined, therefore the given statement is incorrect. 

Q.37 5 % of items in a large batch are defective. If 50 items are selected at random, 
find the probability that at least one will be defective. 


Ans. 0.9231 

Q.38 If X is binomially distributed with n = 10 and p = 0.4.Find the variance of 
Y= (X- 10). 

Ans. 0.0667 


Q.39 A supermarket stocks eggs are in boxes of six, and 10 % of the eggs are 
cracked. Assuming that the cracked eggs are distributed at random, compute 
the probability that a customer will find that the first box he chooses contains 
no cracked egg. 


Ans. 0.5314 

Q.40 In a binomial distribution with n = 5, P(X = 0) = P( X = 1 ). Find the 
variance. 

Ans, 25/36 


Q.41 It is known that 40 % of the defective parts produced in a certain 
manufacturing process can be made satisfactory by rework. Find the 
probability that in a batch of six such defective parts none of them can be 
reworked, 

Ans. 0.0467 


Q.42 If Xis a binomial random variable with n = 25 and p=0.2, 
Find P(X <p- 20). 

Ans. 0.0038 

Q.43 Define hypergeometric experiment. 

Q.44 Write down the properties of hyper metric experiment. 
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Q.45 Define hypergeometric distribution. 


Q.46 State the formula of hypergeometric distribution. 


Q.47 Write down the equation of the hypergeometric distribution. Also write its 
mean and variance. 

Q.48 Write down parameters, mean and variance of the hypergeometric 
distribution, 

Q.49 Write down the properties of the hypergeometric distribution, 

Q.50 Differentiate between binomial and hypergeometric distributions. 

Q.51 Given N= 10,n=4 andk=3, Find P(X = 1), 


Ans. 0,5 

Q.52 Given N=10,n=2andk=3, Find P(X =0). 
Ans, 0.47 

Q.53 Given N= 10,n=4 andk=5. Find E(X). 
Ans. 2 

Q.54 Given N= 40, n=5 andk=8, Find Var(X ). 
Ans. 0.72 

Q.55 GivenN=10,n=4andk= 7. Find $.D.(X). 
Ans. 0.75 


Q.56 If X is a hypergeometric random variable with N = 40, n= 6 and k = 4. Find 
the values of mean and variance. 

Ans. 0.5 and 0.4 

Q.57 A random variable X has hypergeometric distribution with N = 11,n=65 and 
k=7. Find P(X <1). 

Ans. 7/462. 

Q.58 If 13 cards are chosen at random (without replacement) from an ordinary 
deck of 52 cards, Find the probability that 6 are picture cards, 

Ans. 0.0271 
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Q1 


Ans. 


Q.2 


Ans. 


Q.8 


Ans. 


Q.4 


Ans. 


Q.5 


Ans. 


Qé 


Ans. 


Q7 


Ans, 


Q.8 


Ans. 


EXERCISES 


An event has probability p = 3/5. Find complete binomial distribution for n = 5. 
32/3125, 240/3125, 720/3125, 1080/3125, 810/3125, 243/3125 


A and B play a game in which A's chances of winning are 2/3. A series of 5 
games is played. Find the probability that: 


()  Avwill win 3 games (ii) A will win at least 3 games 

(iii) A will win at most 3 games. 

@) 380/243 (ii) 192/243 (iii) 131/243 

A fair coin is tossed four times. What is the probability of getting: 

(i) exactly one head (ii) exactly two heads (iii) at least two heads 
(iv) at most three heads (v) between one and three heads inclusive. 

@ 14 (i) 38 (ii) 11/16 (iv) 15/16 (vy) 7/8 


What is the probability of obtaining (i) 2 sixes, (ii) 3 sixes, (iii) at least 4 sixes 
and (iv) at most 3 sixes when a perfect cubical die is thrown 5 times? 


(i) 1250/7776 Gi) 250/7776 ii) 26/7776 Gv) 7750/7776 


A machine produces parts which are graded A or B according to finish and the 
proportion of A to B is well established at 2:1. Calculate the probabilities of 
obtaining 0, 1, 2, 3, 4, 5 parts of finish A in any sample of 5. 


1/243, 10/243, 40/243, 80/243, 80/243, 32/243 


If 20 % of the bolts produced by a machine are defective, determine the 
probability that of 5 bolts chosen at random. 


(i) 2 bolts are defective (ii) at least 3 bolts are defective. 
G@) 640/3125 Gi) 181/3125 


Team A has probability 2 of winning when ever it plays. If A plays 4 games 
find the probability-that A wins: (i) exactly 2 games (ii) at least one game 

(iii) more than half of the games. 

G@) = 8/27 Gi) 80/81 (ii) 16/27 

A rat has five choices of alternate routes in order to reach the food box. If it is 


true that for each choice the odds are two to one in favour of the correct 
pathway, what is the probability that the rat will make: 


(i) _ all of its choices correctly 

(ii) none of its choices correctly 

(ii) at least one of its choices correctly 

(iv) at least two of its choices correctly. 

@ 32/243 Gi) 1/243 (iii) 242/243 (iv) 232/243 
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Q.9 The probability that a patient recovers from a disease is 0.8. Suppose 5 people 
are known to have contracted the disease. Assuming independence find the 


probability that: 

(i) exactly 4 recover (ii) at least one recover 

(iii) at least one but not more than three recover (iv) at most three recover. 
Ans. (i) 0.4096 Gi) 0.99968 iii) 0.2624 (iv) 0.26272 


ia OE. 
Q.10 The probability that a college student can pass a subject is ae Find the chance 


that out of 5 students: (i) at least 3 will fail (ii) at most 3 will pass. 

Ans. (i) 992/3125, Gi) 2072/3125. 

Q.11The experience of births at a maternity hospital over a long period of time 
indicates that the babies born are boys and girls in equal number. If in a 
particular week 8 pregnant women are admitted to the hospital, determine the 
probability that: (i) there will be an equal number of boys and girls 

(ii) there will be at least 7 girls. 

Ans. (i) 0.2734 (ii) 0.0352 

Q.12 The record of a heart specialist shows that 75 % of the patients he operates 
survive. If on a particular day he operates 6 patients, find the probability that: 


(i) fewer than 4 patients will survive (ii) exactly 4 patients will survive 
(iii) at least 5 patients will survive. 
Ans. (i) 0.1694 (ii) 0.2966 (ii) 0.5339 


Q.13 In a certain district, the need for money to buy drugs is given as the reason for 
75% of all thefts. What is the probability that exactly 2 of the next 4 theft cases 
reported in this district resulted from the need for money to buy drugs? 

Ans. 0.2109 

Q.14A six faced die is so biased that it is twice as likely to show an even number as 
an odd number. If it is thrown five times, find the probability that it shows: 

(i) no even number (ii) all even numbers. 

Ans. (i) 1/248 (ii) 32/243 

Q.15 What is the probability of getting a total of 9: 

(i) twice (ii) at most twice (ili) at least twice in 4 throws of a pair of dice? 

Ans; (i) 0.0585 (ii) 0.9950 (iii) 0.0636 

Q.16If a cadet succeeds 4 times out of 5 in shooting in bull's eye, what is the 
probability that he will miss not more than 3 times in 10 attempts. 

Ans. 0.8791 

Q.17 Find mean, variance and standard deviation of binomial (q + p)*. 

Ans. Mean = 4p, Variance = 4pq, S.D. = \4pq 

Q.18 In a binomial distribution mean = 2.4 and §.D. = 1.2, Find the value of p and n. 

Ans. p=0.4, n=6 
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Q.19A random variable X has a binomial distribution with E( X )=2.4 and p = 0.3. 
Find n and standard deviation of X. 

Ans.n=8, §.D( X ) =1.2961 

Q.20If X is binomially distribution with n = 10 and p = 0.4 then find mean and 


X-10 


variance of Y =“ 

Ans. E( Y ) =— 1, Var( Y ) = 0.0667 

Q.211s it possible to have a binomial distribution with (simply write "yes" or "no"): 
G) Mean=5andvariance=2.5 (ii) P(X=-2)=0,1467 


(iii) P(X 1) =0.35 (iv) | Three parameters viz. n, p and q 
(v) P(X=1)=0.18 (vi) P(X21) = 1.27 
(vii) P(X = 3.8) = 0.29 (vill) P( 1<sX<4)= 0.79 
Ans.(i) Yes (ii) No (iii) Yes (iv) No (v) Yes 
(vi) No (vii) No * (viii) Yes 


Q.22Is it possible to have a binomial distribution with mean 15 and standard 
deviation 3? 


Ans, Yes 
Q.23 Find the mean and variance of binomial distribution ( q + p )*. 
Ans. Mean = 3p, Variance = 3pq 


Q.24A binomial distribution has mean = 1.44 and variance = 0.9216. Find 
probability that X is greater than 2. 


Ans. 0.136235 


Q.25 If a coin is tossed 5 times and the probability that it shows 3 heads is twice the 
probability that it shows one tail, then find q and p. 


Ans.p = q = 1/2. 
Q.26 Find the mean and variance of the binomial probability distribution (q + p)". 
Ans. Mean = np, Variance = npq 


Q.27A random variable X has a binomial distribution with E( X ) = 2 and 
Var (X ) = 24/13. Find the values of n and p and P(X = 2). 


Ans.n = 26,p = + , P(X=2) = 0.28165, 

Q.28 If X is a binomial random variable with n = 10 and p = 0.4, then find its mean, 
variance, standard deviation, E( 2X + 3), Var( 2—3X) and S.D(3- 2X). 

Ans. E(X) = 4, Var(X) = 2.4, S.D(X) = 1.55, E(2X+3)=11, 
Vat( 2-38X) = 21.6, S.D( 3-2X) =3.1. 

Q.29 Given that X is a binomial random variable with parameters n = 1 and p; prove 
that E(X) =p and S.D(X)=bq . 
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Q.30A random variable X has binomial distribution with n= 5 and p=0.4, Find 
(i) the most probable value of X (ii) the expected number of successes, 

Ans:(i) 2 (ii) 2 

Q.31 For a binomial distribution with n = 4 and p = 1/4, Find the complete binomial 
distribution. Also find the mean and variance of the probability distribution 
you have obtained. Is this mean equal to np and variance npq? 

Ans:E(X) =1. Var(X) = 0.75 

Q.32 Out of 2000 families with 4 children each, how many would you expect to have: 
(i) at least one boy (ii) two boys (iii) one or two girls (iv) no oir! 
Assuming equal probabilities for boys and girls. 

Ans. (i) 1875 Gi) 750 (iii) 1250 (iv) 125 

Q.33 Five coins are tossed 96 times: 
(i) Construct the theoretical frequency table 0, 1, 2, 3, 4, 6 heads. 
(i) Find the expected number of times of getting at least 3 heads. 

Ans. (i) 3, 15, 30, 30, 15,3 (ii) 48 

Q.34 Five dice are thrown 243 times. How many times do you expect at least two 
dice to show a three or four. 

Ans. 131 

Q.35 Determine the following: 
@ If n=4, N=10, and k=5, then find P(X=3). 
Gi) Ifn=4,N=6, and k=3, thenfind P(X=1), 
(ii) If n=5, N=12, and k=8, then find P(X > 1). 
(iv) If n=38, k=3, and N= 10, then find P(1<X<3), 

Ans. (i) 0.2381 (ii) 0.2 (iii) 0.3636 (iv) 0.7 

Q.36 Five balls are drawn from a box containing 7 red and 4 blue balls. If X denotes 


the number of red balls drawn from the box, then obtain the probability 
distribution of X. Compute the mean and variance of this distribution. 


Ans. 
REY em ey ie 


Q.37A box contains ten items, seven of which are good and three are detective. A 
sample of four items is to be selected. Compute the probability distribution for 
the numbers of defectives, Also compute the mean and standard deviation of 
this probability distribution, 


Mean = 3.1818, 
Variance = 0.6942 


Ans: 
Ae 
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Q.38 A state lottery is conducted in which six winning numbers are selected from a 
total of 54 numbers. What is the probability that if six numbers are randomly 
selected, 


(i) four numbers will be winning numbers? 
(ii) three numbers will be winning numbers? 
(i) none of the numbers will be winning numbers? 
(iv) aa least one number will be winning numbers? 
Ans. (i) 0.0007 (ii) 0.0134 @ii) 0.4751 — (iv) 0.5249 
Q.39 What is the probability of drawing five cards from a deck and getting: 
(i) three spades (ii) at most two spades (iii) at least three spades? 
Ans. (i) 0.0815 (ii) 0.9072 (iti) 0.0928 


Q.40 From a lot of 10 missiles, 4 are selected at random and fired. If the lot contains 
3 defective missiles that will not fire, what is the probability that 


G) all 4 will fire? (ii) at least 3 will fire? 
4ii) at most 2 will not fire? (iv) at least 2 will not fire? 
Ans. (i) 0.1667 (i) 0.6367 (iii) 0.9667 (iv) 0.3333 
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STATISTICAL TABLES 


RANDOM NUMBERS 


10 09 73 25 33 
37 54 20 48 05 
08 42 26 89 53 
99 01 90 25 29 
12.80 79 99 70 


66 06 57 4717 
3106 01 08 05 
85 26 97 76 02 
63 67 33 21 35 
73 79 64 57 53 


98 52 01 77 67 
11 80 50 54 31 
83 45 29 96 34 
88 68 54 02 00 
99 59 46 73 48 


65 48 1176 74 
80 12 43 56 35 
74 35 09 98 17 
69 91 62 68 03 
09 89 32 05 05 


91 49 91 45 23 
80 33 69 45 98 
44 10 48 19 49 
12 55 07 37 42 
63 60 64 93 29 


611969 04 46 
16 47 44 62 66 
94 55 72 86.73 
42 48 1162 13 
23 52 37 83.17 


04 49 35 24 94 
00 54 99 76 54 
35 96 31 53 07 
59 80 80 83 91 
46 05 88 52 36 


32 17 90 06.97 
69 23 46 14 06 
19 56 54 14 30 
45 16 51 49 38 
94 86 43 19 94 


76 52 01 35 86 
64 89 47 42 96 
19 64 50 93 03 
09 37 67 07 15 
80 15 73 61 47 


34 97 27 68 50 
45 57 18 24 06 
02 05 16 56 92 
05 32 54 70 48 
03 52 96 4778 


14 90 56 86 07 
39 80 82 77 32 
06 28 89 80 83 
86 50 74 8401 
87 51 76 49 69 


17 46 85 09 50 
17 72 70 80 15 
77 40 27 72 14 
66 25 22 91 48 
14 22 56 85 14 


68 47 92 76 86 
26 94 03 68 58 
85 15 74 79 54 
11 10 00 20 40 
16 50 53 44 84 


26 45 74.77 74 
95 27 07 99 53 
67 89 75 43 87 
97 84 40 87 21 
73 20 8 98 37 


75 24 63 38 24 
6405 188159 
26 89 8093 54 
45 42 72 68 42 
01 39 00 22 86 


87 3792 52 41 
20 1174 5204 
01 75 8753 79 
19 47 60 72 46 
36 16 8108 51 


34 67 35 48 76 
24 80 52 40 37 
23 20 90 25 60 
38 31 13 1165 
64 03 23 66 53 


36 69 73 61 70 
35 80 34 26 14 
68 66 57 48 18 
90 55 36 75 48 
35 80 83 42 82 


22 1094 05 58 
50 72 56 82 48 
13 74 67 00 78 
36 76 66 79 51 
91 82 60 89 28 


58 04 77 69 74 
45 31 82 23 74 
43 23 60 02 10 
36 93 68 72 03 
46 42 75 67 88 


46 16 28 35 54 
70 29 78 41 35 
32 97 92 65 75 
12 86 07 46 97 
40 21 95 25 63 


51 92 43 37 29 
59 36 78 38 48 
54 62 24 44 31 
16 86 84 87 67 
68 93 59 14 16 


45 86 25 10 25 
96 11 96 38 96 
33 35 13 64 62 
83 60 94 97 00 
77 28 14 4077 


05 56 70 70 07 
15 95 66 00 00 
40 41 92 15 85 
43 66 79 45 43 
34 88 83 15 53 


80 95 90 9117 
20 63 61 04 02 
15 95 33 47 64 
88 67 67 43 97 
98 95 11 68 77 


65 81 33 98 85 
86 79 90 74 39 
73 05 38 52.47 
28 46 82 87 09 
60 93 52 03 44 


60 97 09 34 33 
29 40 52 4201 
18 47 53 06 10 
90 36 47 6493 
93 78 56 13 68 


73 03 95 71 86 
211157 3253 
45 52 16 42 37 
76 62 113990 
96 29 77 88 22 


94 75 08 99 23 
53 14 03 33 40 
57 60 04 08 81 
96 64 48 94 39 
43 65 17 70 82 


65 39 45 95 93 
82 396101 18 
9119 04 26 92 
03 07 11 2059 
26 25 22 96 63 


61 96 27 93 35 
54 69 28 23 91 
77 97 45 00 24 
13 02 12 48 92 
93 91 08 36.47 


86 74 317157 
18 74 39 24 23 
66 67 43 68 06 
59 04 79 00 33 
01 54 03 54 56 


89 29 27 49 45 
00 82 29 16 65 
85 08 03 36 06 
04 43 62 76 59 
12 17 1768 33 


11 19 92 91 70 
23 40 30 97 32 
18 62 38 85 79 
83 49 12 56 24 
35 27 38 84 35 


50 50 07 39 98 
5277 56 78 51 
68 71 1778 17 
29 60 91 10 62 
23 47 83 4113 


40 21 81 65 44 
14 38 55 37 63 
96 28 60 26 55 
94 40 05 64 18 
54 38 21 45 98 


37 08 92 00 48 
42 05 08 23 41 
22 22 20 64 13 
28 70 72 5815 
07 20 73 17 90 


42 58 26 05 27 
33 21 15 94 66 
92 92 74 5973 
25 70 14 66 70 
05 52 28 25 62 


65 33 71 24 72 
23 28 72 95 29 
90 10 38 93 33 
78 66 52 01 06 
70 61 74 29 41 


85 39 41 18 38 
97 1189 63 38 
8499 28 52 07 
20 82 66 95 41 
05 01 45 11 76 
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AREAS 
UNDER THE 
STANDARD 
NORMAL CURVE 
from 0 toz 0: ee 
(4-places of decimals) 


z 0 al 2 3 4 5 6 a. 8 9 
0.01 0.0000 10.0040 | 0.0080 | 0.0120]0.0160] 0.0199 /0.0239/ 0.0279 | 0.0319 | 0.0359 


0.1| 0.0398 | 0.0438) 0.0478 | 0.0517 | 0.0557 | 0.0596 | 0.0636 | 0.0675 | 0.0714 | 0.0754 
0.2| 0.0793 | 0.0832 | 0.0871 | 0.0910 | 0.0948 | 0.0987 | 0.1026 | 0.1064 0.1103} 0.1141 
0.3) 0.1179 | 0.1217} 0.1255 | 0.1293 | 0.1331 | 0.1368 | 0.1406 | 0.1443 0.1480) 0.1617 
0.4] 0.1554 |0.1591| 0.1628 | 0.1664 | 0.1700 | 0.1736 | 0.1772 | 0.1808 | 0.1844 0.1879 
0.5| 0.1915 | 0.1950) 0.1985 | 0.2019 | 0.2054 | 0.2088 | 0.2123 | 0.2157 | 0.2190 0.2224 
0.6| 0.2258 | 0.2291 | 0.2324 | 0.2357 | 0.2389| 0.2422 | 0.2454 | 0.2486 | 0.2518 | 0.2549 
0.7| 0.2580 | 0.2612| 0.2642 | 0.2673 | 0.2704 | 0.2734 | 0.2764 | 0.2794 | 0.2823 0.2852 
0.8| 0.2881 | 0.2910) 0.2939 | 0.2967 | 0.2996 | 0.3023 | 0.3051 | 0.3078 | 0.3106 0.3133 
0.9| 0.3159 | 0.3186 | 0.3212 | 0.3238 | 0.3264 | 0.3289 | 0.3315 | 0.3340 | 0.3365 0.3389 
1.0] 0.3413 | 0.3438 | 0.3461 | 0.3486 | 0.3508 | 0.3531 | 0.3554 | 0.3577 | 0.3599} 0.3621 
1.1| 0.3643 | 0.3665 | 0.3686 | 0.3708 | 0.3729 | 0.3749 | 0.3770 | 0.3790 | 0.3810 | 0.3830 
1.2| 0.3849 | 0.3869 | 0.3888 | 0.3907 | 0.3925 | 0.3944 | 0.3962 | 0.3980 | 0.3997 0.4015 
1.3] 0.4032| 0.4049 | 0.4066 | 0.4082 | 0.4099 | 0.4115) 0.4131 | 0.4147 | 0.4162 0.4177 
1.4|0.4192| 0.4207 | 0.4222 | 0.4236 | 0.4251 | 0.4265 | 0.4279 0.4292 | 0.4306 0.4319 
1.5| 0.4332 | 0.4345 | 0.4357 | 0.4370 | 0.4382 | 0.4394 | 0.4406 | 0.4418 | 0.4429 0.4441 
1.6|0.4452| 0.4463 | 0.4474 | 0.4484 | 0.4495 | 0.4505 | 0.4515 | 0.4525 | 0.4535 | 0.4545 
1.7| 0.4554 | 0.4564 | 0.4573| 0.4582 | 0.4591 | 0.4599 | 0.4608 | 0.4616 | 0.4625 | 0.4633 
1.8] 0.4641 | 0.4649 | 0.4656 | 0.4664 | 0.4671 | 0.4678 | 0.4686 | 0.4693 | 0.4699 | 0.4706 
1.9| 0.4713 | 0.4719 | 0.4726 | 0.4732 | 0.4738 | 0.4744 | 0.4750 | 0.4756 0.4761 | 0.4767 
2.0| 0.4772 | 0.4778] 0.4783 | 0.4788 | 0.4793 | 0.4798 | 0.4803 | 0.4808 | 0.4812 | 0.4817 
2.1) 0.4821 | 0.4826 | 0.4830 | 0.4834 | 0.4838 | 0.4842 | 0.4846 | 0.4850 | 0.4854) 0.4857 
2.2| 0.4861 | 0.4864 | 0.4868 | 0.4871 | 0.4875 | 0.4878 | 0.4881 | 0.4884 | 0.4887 | 0.4890 
2.3| 0.4893 | 0.4896 | 0.4898 | 0.4901 | 0.4904 | 0.4906 | 0.4909 | 0.4911| 0.4913 | 0.4916 
2.4| 0.4918 | 0.4920) 0.4922 | 0.4925 | 0.4927 | 0.4929 | 0.4931 | 0.4932 | 0.4934 0.4936 
0.4938 | 0.4940 | 0.4941 | 0.4943 | 0.4945 | 0.4946 | 0.4948 | 0.4949 | 0.4951 0.4952 
0.4953| 0.4955 | 0.4956 | 0.4957 | 0.4959 | 0.4960 | 0.4961 | 0.4962 | 0.4963 | 0.4964 
0.4965 | 0.4966 | 0.4967 | 0.4968 | 0.4969 | 0.4970 | 0.4971 | 0.4 72 | 0.4973 | 0.4974 
0.4974 | 0.4975 | 0.4976 | 0.4977 | 0.4977 | 0.4978 | 0.4979 | 0.4 79 | 0.4980 0.4981 
0.4981 | 0.4982 | 0.4982 | 0.4983 | 0.4984 | 0.4984 | 0.4985 | 0.4986 | 0.4986 | 0.4986 
0.4987 | 0.4987 | 0.4987 | 0.4988 | 0.4988 | 0.4989 | 0.4989 | 0.4989 | 0.4990 0.4990 
0.4990 | 0.4991 | 0.4991 | 0.4991 | 0.4992 | 0.4992 | 0.4992 | 0.4992) 0.4993 0.4993 
0.4993 | 0.4993 | 0.4994 | 0.4994 | 0.4994 | 0.4994 | 0.4994 | 0.4995 | 0.4995 | 0.4995 
0.4995 | 0.4995 | 0.4995 | 0.4996 | 0.4996 | 0.4996 | 0.4996 | 0.4996 | 0.4996 | 0.4997 
0.4997| 0.4997 | 0.4997 | 0.4997 | 0.4997 | 0.4997 | 0.4997 | 0.4997 | 0.4997 | 0.4998 
0.4998 | 0.4998 | 0.4998 | 0.4998 | 0.4998 | 0.4998 | 0.4998 0.4998 | 0.4998 | 0.4998 
0.4998 | 0.4998 | 0.4999 | 0.4999 | 0.4999 | 0.4999 | 0.4999 | 0.4999 | 0.4999 0.4999 
0.4999 | 0.4999 | 0.4999 | 0.4999 | 0.4999 | 0.4999 | 0.4999 | 0.4999 | 0.4999 | 0.4999 
8| 0.4999 | 0.4999 | 0.4999 | 0.4999 | 0.4999 | 0.4999 | 0.4999 | 0.4999 | 0.4999 | 0.4999 
3.9| 0.5000 | 0.5000 | 0.5000 | 0.5000 | 0.5000 | 0.5000 | 0.5000 | 0.5000 | 0.5000 0.5000 
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AREA UNDER THE 
STANDARD NORMAL CURVE 
From 0 to z ( 5-places of decimals) 0 7 


5. 


-00000 | .00399 | .00798 | .01197 | .01595 | .o1994 | .02392 | .02790 -03188 | 03585 
.03983 | .04380 | 04776 | .05172 | 05567 -05962 | .06356 | .06749 | .07142 | .07535 
-07926 | .08317 | .08706 | .09096 | .09483 | .09871 | .10257 | .10642 -11026 | .11409 
“11791 | .12172 | .12552 | 12980 | .13307 | 13683 | .14058 | .14431 | .14803 15173 
15542 | .15910 | .16276 | 16640 | .17003 | .17364 | .17726 | .18082 | .18430 .18793 
-19146 | .19497 | .19847 | .20184 | .20540 | .20884 | 21226 | .21566 | 21904 22240 
-22575 | .22907 | .23237 | .23666 | .23891 | 24215 | 24537 | .24857 | 25175 25490 
-25804 | .26116 | .26424 | 26780 | .27035 | .27337 | 27637 | .27935 | .28230 28524 
-28814 | .29108 | .29388 | .29673 | .29955 | .30234 | 30511 | .30785 33646 | 33891 
-81594 | 31869 | .32121 | 32381 | .32639 | 32894 | .33147 -33398 | .33646 | 33891 
-34134 | .343765 | .34614 | 34850 | .35083 | .35314 | 35543 | .35769 -35993 | 36214 
-36433 | .36650 | .36864 | .37076 | .37286 | 37493 | 37698 -37900 | 38100 | .28298 
-38493 | .38686 | .38877 | 39065 | .39251 | .39435 | 39617 -39796 | 89973 | .40147 
-40220 | .40490 | .40658 | 40824 | .40988 | 41149 | .41309 | 41466 41621 | 41774 
41924 | 42073 | .42220 | 42364 | 42507 | .42647 | 42786 | 42922 -43056 | .43189 
-43319 | .43448 | .43574 | 43699 | .43822 | 43943 | 44062 44170 | 44295 | .44408 
44520 | .44630 | .44738 | 44845 | .44950 | .45053 | 45154 | .45254 45352 | 45449 
-45543 | 45637 | .45728 | 45818 | .45907 | .45994 | 46080 | .46164 46246 | 46327 
-46407 | .46485 | .46562 | 46638 | .46712 | .46784 -46856 | .46926 | .56495 | .47062 
-47128 | .47198 | .47257 | .47320 | .47381 | .47441 | .47500 | 47558 47616 | .47679 
-47725 | 47778 | .47831 | .47882 | .47932 | 47982 | .48030 | 48077 48124 | 48169 
5 -48300 | .48341 | .48382 | .48422 | 48461 | .48500 | .48537 | 48574 
-48610 | .48645 | .48679 | .48713 | .48745 | 48778 | 48809 | .48840 48870 | .48899 
48928 | .48956 | .48983 | .49010 | .49036 | .49061 | .49086 | .49111 -49134 | .49158 
49180 | .49202 | .49224 | .49245 | .49266 | .49286 | .49305 -49324 | .49243 | 49361 
-49379 | 49393 | .49413 | .49430 | .49446 | 49461 | .49477 | .49492 -49506 | .49520 
49534 | .49647 | .49560 | .49573 | .49586 | .49598 | .49609 | .49621 49632 | .49643 
-49653 | .49664 | .49674 | .49683 | .49693 | .49702 | .49711 | .49720 49728 | .49736 
49744 | 49752 | .49760 | .49767 | .49774 | .49781 | 49788 | .49795 -49801 | .49807 
-49813 | .49819 | .49826 | .49831 | .49836 | .49841 | .49846 | .49851 -49856 | .49861 
-49865 | 49869 | .49874 | .49878 | .49882 | .49886 | .49889 -49897 | .49897 | .49800 
49903 | .49906 | .49910 | .49913 | .49916 | .49918 | .49921 | .49994 -49926 | .49929 
-49931 | .49934 | .49936 | .49938 | .49940 | .49942 | 49944 49946 | .49948 | 49950 
49952 | .49953 | .49955 | .49957 | .49968 | 49960 | .49961 -49962 | .49964 | .49965 
-49966 | .49968 | .49969 | .49970 | .49971 | .49972 | .49973 49974 | .49975 | 49976 
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Ordinates (Y) 
of the 
Standard 
Normal Curve ys 
atz 


z/| 0 i tee sag Be fe 87) 5 6 7 8 9 

0.0/0.3989 | 0.3989 | 0.3989 | 0.3988 / 0.3986 /0.3984 | 0.3982 | 0.3980) 0.397710.3973 | 
0.1/0.3970 | 0.3965 0.3961 | 0.3956 0.3951 | 0.3945 | 0.3939 0.3932 0.3925 | 0.3918 
0.2) 0.3910 | 0.3902) 0.3894 | 0.3885 | 0.3876 | 0.3867 | 0.3857 | 0.3847 | 0.3836| 0.3825 
0.3) 0.3814) 0.3802 |0.3790|0.3778 0.3765 0.3752 0.3739 | 0.3725 | 0.3712 0.3697 
0.4) 0.3683 | 0.3668 | 0.3653 0.3637 | 0.3621 | 0.3605 0.3589 | 0.3572 | 0.3555 | 0.3538 
0.5) 0.3521 | 0.3503 | 0.3485 | 0.3467 0.3448) 0.3429 | 0.3410| 0.3391 |0.3372| 0.3352 
0.6) 0.3332 | 0.3312 | 0.3292 / 0.3271| 0.3251 | 0.3230 | 0.3209 | 0.3187|0.3166|0.3144 
0.7/ 0.3123 | 0.3101 | 0.3079 | 0.3056] 0.3034 | 0.3011 | 0.2989 | 0.2966 | 0.2943 | 0.2920 
0.8) 0.2897 | 0.2874 | 0.2850 | 0.2827 | 0.2803 | 0.2780 | 0.2756] 0.2732 |0.2709| 0.2685 
0.9) 0.2661 | 0.2637 | 0.2613 | 0.2589 0.2565) 0.2541 | 0.2516 0.2492 |0.2468| 0.2444 
1.0) 0.2420 | 0.2396 | 0.2371 | 0.2347 | 0.2323 | 0.2299 | 0.2275 | 0.2251 | 0.2227| 0.2203 
1.1) 0.2179 |0.2156 | 0.2131 | 0.2107 | 0.2083 | 0.2059 0.2036 0.2012 /0.1989| 0.1965 
1.2/0.1942 | 0.1919) 0.1895) 0.1872 | 0.1849 0.1826 | 0.1804 | 0.1781 | 0.1758) 0.1736 
1.3) 0.1714] 0.1691 | 0.1669 | 0.1647 0.1626 | 0.1604 | 0.1582 |0.1561/0.1539|0.1518 
1.4/0.1497 | 0.1476) 0.1456 | 0.1435 | 0.1415 | 0.1394 | 0.1374 |0.1354/0.1334/0.1315 
1.5/0.1295 | 0.1276 | 0.1257 0.1238 | 0.1219 0.1200 | 0.1182|0.1163|0.1145|0.1127 
1.6/0.1109/ 0.1092) 0.1074 | 0.1057 | 0.1040 0.1023 | 0.1006 | 0.0989 | 0.0973 | 0.0957 
1.7/ 0.0940 | 0.0925 | 0.0909 | 0.0893) 0.0878 | 0.0863 | 0.0848 | 0.0833 | 0.0818] 0.0804 
1.8/ 0.0790 | 0.0775 | 0.0761 | 0.0748 | 0.0734 | 0.0721| 0.0707 | 0.0694 | 0.0681 | 0.0669 
1.9/ 0.0656 | 0.0644 | 0.0632 | 0.0620) 0.0608 | 0.0596 | 0.0584 | 0.0573 | 0.0562) 0.0551 
2.0/ 0.0640 | 0.0529) 0.0519 | 0.0508 | 0.0498 | 0.0488 | 0.0478 | 0.0468 0.0459 |0.0449 
2.1/0.0440 |0.0431 0.0422 0.0413 | 0.0404 |0.0396 | 0.0387 | 0.0379 | 0.0371 | 0.0363 
2.2 0.0355 | 0.0347 0.0339 0.0332| 0.0325 | 0.0317 0.0310 0.0303 0.0297 | 0.0290 
2.3/0.0283 | 0.0277 | 0.0270 | 0.0264 | 0.0258) 0.0252 | 0.0246| 0.0241) 0.0235 | 0.0229 
2.4) 0.0224] 0.0219 | 0.0213 | 0.0208 | 0.0203 | 0.0198 | 0.0194 | 0.0189 |0.0184| 0.0180 
2.5) 0.0175 | 0.0171 | 0.0167 | 0.0163 0.0158 | 0.0154 0.0151 | 0.0147|0.0143|0,0139 
2.6/ 0.0136 | 0.0132 | 0.0129) 0.0126) 0.0122 |0.0119 0.0116|0.0113/0.0110|0.0107 
2.7/0.0104 | 0.0101 0.0099) 0.0096 0.0093 | 0.0091 | 0.0088 | 0.0086 | 0.0084 | 0.0081 
2.8) 0.0079 | 0.0077 | 0.0075 | 0.0073 | 0.0071 | 0.0069 | 0.0067 | 0.0065 |0.0063| 0.0061 
2.9 0.0060 | 0.0058 | 0.0056 | 0.0055 | 0.0053 | 0.0051 | 0.0050! 0.0048 | 0.0047 | 0.0046 
3.0| 0.0044 | 0.0043 | 0.0042 | 0.0040} 0.0039 | 0.0038 | 0.0037 | 0.0036] 0.0035 | 0.0034 
3.1) 0.0033 | 0.0032 | 0.0031 | 0.0030 | 0.0029 | 0.0028 | 0.0027 | 0.0026 | 0.0025 | 0.0025 
3.2) 0.0024 | 0.0023 | 0.0022 | 0.0022/ 0.0021 | 0.0020 0.0020 | 0.0019 | 0.0018 | 0.0018 
3.3| 0.0017 | 0.0017] 0.0016 | 0.0016 0.0015 | 0.0015 | 0.0014 | 0.0014 | 0.0013) 0.0013 
3.4) 0.0012) 0.0012) 0.0012/ 0.0011] 0.0011 | 0.0010 0.0010 |0.0010 | 0.0009 | 0.0009 
3.5| 0.0009 0.0008) 0.0008] 0.0008 | 0,0008 0.0007 | 0.0007 | 0.0007 | 0.0007 | 0.0006 
3.6 0.0006 | 0.0006 | 0.0006 | 0.0005 | 0.0005 | 0.0005 | 0.0005 | 0.0005 | 0.0005 | 0.0004 
3.7) 0.0004 | 0.0004 | 0.0004 | 0.0004 | 0.0004 | 0.0004 | 0.0003 | 0.0003] 0.0003 | 0.0003 
3.8| 0.0003 | 0.0003 | 0.0003 | 0.0003 | 0.0003 | 0.0002 | 0.0002 0.0002 0.0002! 0.0002 
[3.9] 0.0002 | 0.0002 | 0.0002 | 0.0002 | 0.0002 | 0.0002 0.0002 | 0.0002 | 0.0001 | 0.0001 


Statistical Tables (403 3 
Critical Values for Student's t Distribution 
-— 


Level of significance for one-tailed test 
f T T 
df | 0.10 0.05 | 0.025_| 0.01 0.005 0.0005 
v | | mil — 
| Level of significance for two-tailed test 
k 
| 0.20 0.10 0.05 0.02 0.01 0.001 
1 | 8.078 6.314 12.706 31.821 63.657 636.619 
2 1.886 2.920 4,303 6.965 9.925 81.598 
8 | 1.638 2.358 3.182 4,541 5.841 12.941 
4 | 1,538 2,182 2.776 3,747 4.604 8.610 
5 1.476 2.015 2,571 3.365 4.082 6.859 
6 1.440 1.948 2,447 3.143 8.707 5.969 
“f 1.415 1.895 2.365 2,998 8.499 5.405 
8 1.897 1.860 2,306 2.896 8.355 5.041 
9 1.383 1,833 2.262 2.821 3.250 4.781 
10 1.372 1.812 2.228 2.764 3.169 4.587 
11 1.363 1.796 2.201 2.718 3.106 4.437 
12 1.856 1.782 2.179 2.681 8.055 4.318 
13 1.350 1.771 2.160 2.650 8.012 4,221 
14 1.345 1.761 2.145 2.624 2.977 4.140 
15 1.341 1.753 2.131 2.602 2.947 4.073 
16 1.337 1,746 2.120 2.583 2.921 4.015 
17 1.333 1.740 2.110 2.567 2.898 3.965 
18 1,330 1.784 2.101 2.552 2.878 8.922 
19 1.328 1.729 2.098 2.539 2.861 3.883 
20 1.3825 1.725 2.086 2.528 2.845 3.850 
21 1.323 1,721 2.080 2.518 2.881 3.819 
22 1,321 L717 2.074 2.508 2.819 8.792 
23 1.319 1.714 2.069 2.500 2.807 8.767 
24 1.318 1.711 2.064 2.492 2.797 8.746 
25 1.316 1.708 2.060 2.485 2.787 3.725 
26 1.315 1.706 2,056 2.479 2.779 3.707 
27 1.814 1.708 2.062 2.473 2.771 3.690 
28 1.313 1.701 2.048 2.467 2.763 3.674 
29 | 1.311 1.699 2.046 2.462 2.756 3.659 
30 | 1.810 1.697 2.042 2.457 2.750 3.646 
40 | 1.803 1.684 2.021 2.423 2.704 3.651 
60 } 1,296 1.671 2.000 2,390 2.660 3.460 
120 | 1.289 1.658 1,980 2.358 2.617 3.373 
a | 1.282 | 1.645 1,960 | 2.326 2,576 3.291 
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Critical Values for the Chi-square (2) Distribution 


a 
df : ; 

v 010 | 0.05 0.025 | 0.02 | 0.01 0.005 | 0.001 
ee a706 | 3.841 5024 | 5412 | 6635 | 7.879 | ot 
2 4.605 5.991 7.378 | 7.824 9.210 | 10.697 | 18.816 
3 6.251 7.815 9.348 | 9.887 | 11.345 | 12.838 | 16.268 
4 7.179 9.488 | 11.143 | 11.668 | 13.277 | 14,860 | 18.465 
5 9.236 | 11.070 | 12.882 | 18.888 | 15.086 | 16.750 | 20.517 
6 10.645 | 12.592 | 14.449 | 15.033 | 16.812 | 18.548 | 22.467 
7 12,017 | 14.067 | 16.018 | 16.622 | 18.475 | 20.278 | 24.822 
8 18.862 | 15.507 | 17.585 | 18.168 | 20.090 | 21.966 26.125 
9 14.684 | 16.919 | 19.028 | 19.679 | 21.666 | 23.589 | 27.877 
10 15.987 | 18.307 | 20.483 | 21.161 | 23.209 | 25.188 | 29.588 
1 17.275 | 19.675 | 21.920 | 22.618 | 24.725 | 26.757 | 81.264 
12 18.549 | 21.026 | 23.337 | 24.054 | 26.217 | 28.800 | 32.909 
13 19.812 | 22.362 | 24.736 | 25.472 | 27.688 | 29.819 | 34.528 
4 21.064 | 28.685 | 26.119 | 26.873 | 29.141 | 31.819 | 36.123 
15 22.307 | 24.996 | 27.488 | 28.259 | 80.578 | 32.801 | 37.697 
16 29.542 | 26.296 | 28.845 | 29.633 | 32.000 | 34.267 | 39.252 
17 24.769 | 27.587 | 30.191 | 30.995 | 83.409 | 35.718 | 40.790 
18 25.989 | 28.869 | 31.526 | 32.346 | 34.805 | 37.156 | 42.312 
19 27.204 | 30.144 | 82.852 | 32687 | 36.191 | 38.582 | 43.820 
20 28.412 | 31.410 | 34.170 , ‘35.020 | 37.566 | 39.997 | 45.316 
21 29.615 | 32.671 | 35.479 | 36.343 | 38.932 | 41.401 | 46.797 
22 30.818 | 33.924 | 36.781 | 37.659 | 40.289 | 42.796 | 48.268 
23 32,007 | 36.172 | 38.076 38.968 | 41.638 | 44.181 | 49.728 
24 83,196 | 36.415 | 39.364 | 40.270 | 42.980 | 45.568 | 51.179 
25 34.882 | 37.662 | 40.646 | 41.566 | 44.314 | 46.928 | 52.620 
26 35.563 | 38.886 | 41.923 | 42.856 | 45.642 | 48.290 | 54.052 
27 36.741 | 40.113 | 43.194 | 44.140 | 46.963 | 49.645 | 55.476 
28 37.916 | 41.337 | 44.461 | 45.419 | 48.278 | 50.993 | 56.893 
29 39,.087°| 42.557 | 45.722 46.693 | 49.588 | 52.336 | 58.302 
30 40.256 | 43.773 | 46.979 | 47.962 | 50.802 53.672 | 59.703 | 


